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Abstract 

We provide an assessment of the state of the art in various issues related to exper- 
imental measurements, phenomenological methods and theoretical results relevant 
for the determination of partem distribution functions (PDFs) and their uncertainties, 
with the specific aim of providing benchmarks of different existing approaches and 
results in view of their application to physics at the LHC. 

We discuss higher order corrections, we review and compare different approaches 
to small x resummation, and we assess the possible relevance of partem saturation 
in the determination of PDFS at HERA and its possible study in LHC processes. 
We provide various benchmarks of PDF fits, with the specific aim of studying is- 
sues of error propagation, non-gaussian uncertainties, choice of functional forms of 
PDFs, and combination of data from different experiments and different processes. 
We study the impact of combined HERA (ZEUS -HI) structure function data, their 
impact on PDF uncertainties, and their implications for the computation of standard 
candle processes, and we review the recent Fl determination at HERA. Finally, 
we compare and assess methods for luminosity measurements at the LHC and the 
impact of PDFs on them. 
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1 INTRODUCTION 



With the start of data-taking at the LHC getting closer, the importance of a detailed understanding of the 
physics of parton distributions (PDFs) has increased considerably, along with the awareness of the LHC com- 
munity for the importance of the issues related to it. Clearly, the main reason why PDFs are important at the 
LHC is that at a hadron collider a detailed understanding of PDFs is needed in order to obtain accurate predic- 
tions for both signal and background processes. Indeed, for many physical processes at the LHC, PDFs are the 
dominant source of uncertainty. On the other hand, an accurate control of PDF uncertainties allows one to use 
selected processes as "standard candles", for instance in the determination of luminosities. However, this also 
means that experimentation at the LHC will provide a considerable amount of new experimental information 
on PDFs, and it will enable us to test the adequacy of their current theoretical understanding. 

The main aim of this document is to provide a state of the art assessment of our understanding of 
PDFs at the dawn of the LHC. Since the previous HERA-LHC workshop [1], we have witnessed several 
important directions of progress in the physics of PDFs. On the theoretical side there has been conclusive 
progress in extending the treatment of perturbative QCD beyond the current default, namely, the next-to- 
leading perturbative order. On the phenomenological side there has been a joint effort between experimental 
and theoretical groups involved in the extraction of PDFs, specifically from global fits, in agreeing on common 
procedures, benchmarks and standards. On the experimental side, new improved results from the HERA runs 
are being finalized: these include both the construction of a joint determination of structure function which 
combines the result of the ZEUS and HI experiments, and the first direct measurements of the structure 
function Fl which have been made possible by running HERA at a reduced proton beam energy in 2007. 
Also, the LHC experiments (ATLAS, CMS and LHCb) are now assessing the use of standard candle processes 
for luminosity measurements. 

All these issues are discussed in this document. In each case, our main goal has been to provide as much 
as possible a joint treatment by the various groups involved, as well as a comparison of different approaches 
and benchmarking of results. In particular, in Sect. |2j after briefly reviewing (Sect. 12.11 ) the current status of 
higher-order calculations for DIS, we provide (Sect. 12.21 ) detailed comparisons of techniques and results of 
different existing approaches to small x resummation, and then we summarize (Sect. 12.31) the current status 
of studies of parton saturation at HERA, their possible impact on current PDF extraction and the prospects of 
future studies at the LHC. In Sect. [3] we discuss methods and results for the benchmarking of PDF fits: with 
specific reference to two benchmark fits based on a common agreed set of data, we discuss issues related to 
error propagation and non-gaussian errors, to the choice of functional form and corresponding bias, to possible 
incompatibilities between different data sets. In Sect. @] we turn to recent progress in the extraction of PDFs 
from HERA data, specifically the impact of combined ZEUS -HI structure function data on PDF determination 
and the ensuing calculation of W and Z cross-sections (Sect. 14.11) and the recent first determination of the 
structure function Fl (Sect. 14.21) . In Sect. [5] we discuss and compare luminosity measurements based on 
absolute proton-proton luminosity measurements to those based on the use of standard candle processes, and 
the impact on all of them of PDF uncertainties. Finally, in Sect. [6] we present the PDF4LHC initiative, which 
will provide a framework for the continuation of PDF studies for the LHC. 

Note: Most of the contributions to this workshop are the result of collaboration between various groups. 
The common set of authors given for each section or subsection has read and approved the entire content of 
that section or subsection; however, when a subset of these authors is given for a specific part of the section or 
subsection, they are responsible for it. 



2 THEORETICAL ISSUES 

2.1 Precision calculations for inclusive DIS: an update 20 

With high-precision data from HERA and in view of the outstanding importance of hard scattering cross 
sections at the LHC, a quantitative understanding of deep-inelastic processes is indispensable, necessitating 
calculations beyond the standard next-to-leading order of perturbative QCD. 

In this contribution we briefly discuss the recent extension of the three-loop calculations for inclusive 
deep-inelastic scattering (DIS) [2-9] to the complete set of coefficient functions for the charged-current (CC) 
case. The new third-order expressions are too lengthy for this short overview. They can be found in Refs. [10, 
11] together with the calculational methods and a more detailed discussion. Furthermore the reader is referred 
to Refs. [12, 13] for our first results on the three-loop splitting functions for the evolution of helicity-dependent 
parton distributions. 

Structure functions in inclusive deep-inelastic scattering are among the most extensively measured ob- 
servables. The combined data from fixed-target experiments and the HERA collider spans about four orders 
of magnitude in both Bjorken-x variable and the scale Q 2 = —q 2 given by the momentum q of the exchanged 
electroweak gauge boson [14]. Here we consider the VF-exchange charged-current case, see Refs. [15-21] 
for recent data from neutrino DIS and HERA. With six structure functions, F^ , F^ and F™ , this case 
has a far richer structure than, for example, electromagnetic DIS with only two independent observables, F^ 
and Fl- 

Even taking into account a forthcoming combined Hl/ZEUS final high-C} 2 data set from HERA, more 
detailed measurements are required to fully exploit the resulting potential, for instance at a future neutrino 
factory, see Ref. [22], and the LHeC, the proposed high-luminosity electron-proton collider at the LHC [23]. 
Already now, however, CC DIS provides important information on the parton structure of the proton, e.g., 
its flavour decomposition and the valence-quark distributions. Moreover, present results are also sensitive to 
electroweak parameters of the Standard Model such as sin 2 Oyy, see Ref. [24], and the space-like VF-boson 
propagator [25]. As discussed, for example, in Refs. [26-29], a reliable determination of sin 2 dyy from neu- 
trino DIS requires a detailed understanding of non-perturbative and perturbative QCD effects. 

Previous complete results on unpolarized DIS include the three-loop splitting functions [5, 6] as well 
as the 3-loop coefficient functions for the photon-exchange structure functions F<i,l [7, 8]. However, most 
coefficient functions for CC DIS were not fully computed to three loops so far. 

For this case it is convenient to consider linear combinations of the structure functions F^ v± with simple 
properties under crossing, such as Fa P±up (a = 2, 3, L) for neutrino DIS. For all these combinations either the 
even or odd moments can be calculated in Mellin-A r space in the framework of the operator product expansion 
(OPE), see Ref. [30]. The results for the third-order coefficient functions for the even-N combinations F^ p jJ vv 
can be taken over from electromagnetic DIS [7, 8]. Also the coefficient function for the odd-iV based charged- 
current structure function F^ p+up is completely known at three-loop accuracy, with the results only published 
via compact parameterizations so far [9]. For the remaining combinations p^P~ v P an d F^ p ~ up , on the other 
hand, only recently the first six odd or even integer moments of the respective coefficient functions have been 
calculated to third order in Ref. [10] following the approach of Refs. [2-4] based on the MINCER program 
[31,32]. 

The complete results of Refs. [7-9] fix all even and odd moments N. Hence already the present knowl- 
edge of fixed Mellin moments for F^ p ~ up and F^ p ~ vp is sufficient to determine also the lowest six moments 
of the differences of corresponding even-iV and odd-iV coefficient functions and to address a theoretical con- 
jecture [33] for these quantities, see Ref. [11]. Furthermore these moments facilitate x-space approximations 
in the style of, e.g, Ref. [34] which are sufficient for most phenomenological purposes, including the determi- 
nation of the third-order QCD corrections to the Paschos-Wolfenstein relation [35] used for the extraction of 
sin 2 Ow from neutrino DIS. 

The even-odd differences of the CC coefficient functions C a for a = 2, 3, L can be defined by 

5C 2 , L = C 2 "£ +Pp - C» p - 9p , 5C 3 = c »v-vp _ c vp+v p _ (1) 
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The signs are chosen such that the differences are always 'even - odd' in the moments N accessible by the 
OPE [30], and it is understood that the d abc d a bc part of C^ p+up [4,9] is removed before the difference is 
formed. With a s = q s /(47t) these non-singlet quantities can be expanded as 

5C a = aj<*4°- (2) 

1=2 

There are no first-order contributions to these differences, hence the above sums start at I = 2 . 

We start the illustration of these recent results by looking at the approximations for the up — up odd- 
N coefficient functions c£ L (x) (see Ref. [11] for a detailed discussion). These are compared in Fig. Q] to 
their exact counterparts [7, 8] for the even-iV non-singlet structure functions. The dashed lines represent the 
uncertainty band due to the limited number of known moments. The third-order even-odd differences remain 
noticeable to larger values of x than at two loops, e.g., up to x ~ 0.3 for F2 and x ~ 0.6 for Fl for the four- 
flavour case shown in the figure. The moments N = 1, 3, . . . , 9 constrain 8c^\{x) very well at x ^ 0.1, 
and approximately down to x ps 10~ 2 . 




-3 -2 -1 -3 -2 -1 

10 10 10 __ 10 10 10 

A. A. 

Fig. 1: The exact third-order coefficient functions of the even-iV structure functions F^ v ^ vv for four massless flavours, and the 
approximate odd-moment quantities for up — up combination. 



Concerning low values of Bjorken-x one should recall that the uncertainty bands shown by the dashed 
lines in Fig. Q] do not directly indicate the range of applicability of these approximations, since the coefficient 
functions enter observables only via smoothening Mellin convolutions with non-perturbative initial distribu- 
tions. In Fig. [2] we therefore present the convolutions of all six third-order CC coefficient functions with a 
characteristic reference distribution. It turns out that the approximations of the previous figure can be suffi- 
cient down to values even below x = 10 -3 , which is amply sufficient for foreseeable applications to data. The 
uncertainty of 5clj (x), on the other hand, becomes relevant already at larger values, x < 1CT 2 , as the lowest 
calculated moment of this quantity, N = 2, has far less sensitivity to the behaviour at low x. 

The three-loop corrections to the non-singlet structure functions are rather small even well below the 
x- values shown in the figure - recall our small expansion parameter a s : the third-order coefficient are smaller 
by a factor 2.0 • 10~ 3 if the expansion is written in powers of a B . Their sharp rise for x — ► 1 is understood in 
terms of soft-gluon effects which can be effectively resummed, if required, to next-to-next-to-next-to-leading 
logarithmic accuracy [36]. Our even-odd differences 5c a (x), on the other hand, are irrelevant at x > 0.1 but 
have a sizeable impact at smaller x in particular on the corrections for F2 and Fl. The approximate results for 
Sea (x) facilitate a first assessment of the perturbative stability of the even-odd differences £[]). In Fig. [3] we 
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Fig. 2: Convolution of the six third-order CC coefficient functions for F2,3, l in up + up and up — up DIS with a schematic but 
typical non-singlet distribution/. All results have been normalized to/(i), suppressing the large but trivial variation of the absolute 
convolutions. 

illustrate the known two orders for F2 and Fl for a s = 0.25 and n f = 4 massless quark flavours, employing 
the same reference quark distribution as in Fig. 12 

Obviously our new a s 3 corrections are important wherever these coefficient-function differences are 
non-negligible. On the other hand, our results confirm that these quantities are very small, and thus relevant 
only when a high accuracy is required. These conditions are fulfilled for the calculation of QCD corrections 
for the so-called Paschos-Wolfenstein relation. This relation is defined in terms of a ratio of neutral-current 
and charged-current cross sections for neutrino-nucleon DIS [35], 



R- 



a(v„N 



(3) 



The asymmetry R~ directly measures sin 2 6w if the up and down valence quarks in the target carry equal 
momenta, and if the strange and heavy-quark sea distributions are charge symmetric. Beyond the leading 
order this asymmetry can be presented as an expansion in a s and inverse powers of the dominant isoscalar 
combination u~ + dr , where q~ = J Q dx x (q(x) — q(x)) is the second Mellin moment of the valence quark 

(3) 

distributions. Using the results for differences 5c a (%), a = 2, L, 3 one can present it in a numeric form, 



R = - — sin 6w + 

-- 



dr + c 



u + d 



7 2 A 2 

1 — - sin 6 W + I - - sin W 



8«s 

9 7T 



[l + 1.689 a s + (3.661 ±0.002) a 2 ] \ + O ({vT + (T)" 2 ) + 0(af) , 



(4) 



where the third term in the square brackets is determined by the a 3 corrections 5c*a\x), a = 2, L, 3. The 
perturbation series in the square brackets appears reasonably well convergent for relevant values of the strong 
coupling constant, with the known terms reading, e.g., 1 + 0.42 + 0.23 for a s = 0.25. Thus the a 2 and 
contributions correct the NLO estimate by 65% in this case. On the other hand, due to the small prefactor 
of this expansion, the new third-order term increases the complete curly bracket in Eq. (@]) by only about 
1%, which can therefore by considered as the new uncertainty of this quantity due to the truncation of the 
perturbative expansion. Consequently previous NLO estimates of the effect of, for instance, the (presumably 
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Fig. 3: The first two approximations, denoted by LO and NLO, of the differences (Q} for F2 and Fl in charged-current DIS. The 
results are shown for representative values of a a and n; after convolution with the reference distribution/(a-) also employed in Fig.[2] 
The dashed curves correspond to the two approximation uncertainties for the new a s 3 contributions. 



mainly non-perturbative, see Refs. [37-39]) charge asymmetry of the strange sea remain practically unaffected 
by higher-order corrections to the coefficient functions. 

To summarize, we have extended the fixed- N three-loop calculations of inclusive DIS [2-4] to all 
charged-current cases not covered by the full (all-iV) computations of Refs. [7-9]. The region of applicability 
of these new results is restricted to Bjorken-x values above about 10~ 3 , a range amply sufficiently for any 
fixed-target or collider measurements of those charged-current structure functions in the foreseeable future. 
Except for the longitudinal structure function Fl, the present coefficient functions are part of the next-to- 
next-to-next-to-leading order (N 3 LO) approximation of massless perturbative QCD. Analyses at this order are 
possible outside the small-a; region since the corresponding four-loop splitting functions will have a very small 
impact here, cf. Ref. [40]. 



2.2 Small x resummation 



21 



The splitting functions which govern the evolution of the parton distributions (PDFs), together with the hard 
cross sections which relate those partons to hadronic physical observables, are potentially unstable at high 
energy due to logarithmically enhanced contributions. In particular, parametrizing observables such as deep- 
inelastic structure (DIS) functions or Drell-Yan (DY) or Higgs production cross section in hadronic collisions 
in terms of a dimensionful scale Q 2 (photon virtuality or invariant mass of the final state in DIS and DY respec- 
tively) and a dimensionless ratio x (the Bjorken variable or ^- in DIS and DY respectively), when x — > there 
are logarithmically enhanced contributions to the perturbation expansion of the form x~ 1 a s (Q 2 ) log m (l/x) 
(n > m— 1). When x is sufficiently small, one must resum such terms, reordering the perturbation expansion 
in terms of leading logarithmic (LL) terms followed by next-to-leading logarithmic (NLL) terms and so on. 

The problem can be traced to ladders of i-channel gluon exchanges at LL order, with some quark 
mixing at NLL order and beyond. The underlying framework for the resummation procedure is the BFKL 
equation [41,42], an integral equation for the unintegrated gluon f(k 2 , Qq) that is currently known up to full 
NLL order [43-45], and approximate NNLL order [46]. This has the schematic form (up to NLL): 

Nf(k 2 ,Ql)=Nf I (Ql) + a s (k 2 ) J dk' 2 [lC (k 2 , k'\ Q§) +a s (fc 3 )£i (k 2 ,k' 2 ,Q 2 )\ f(k' 2 ), (5) 

where //(Qg) * s a non-perturbative initial condition at some initial scale Qq, as = 3as/ir and K.0,1 are the 
^Contributing authors: G. Altarelli, R. D. Ball, M. Ciafaloni, D. Colferai, G. P. Salam, A. Stasto, R. S. Thorne, C. D. White 



LL and NLL BFKL kernels. Different choices for the argument of the running coupling are possible, leading 
to accordingly modified JC\ [47,48]. 

The solution of the BFKL equation can be used to extract leading and subleading singular contributions 
to singlet DGLAP splitting functions. The BFKL equation can either be solved numerically in its form given 
by Eq. ©, or else analytically by performing a double Mellin transform with respect to x and k 2 : 



whereby the BFKL equation becomes a differential equation, with kernels xo.i (7) defined respectively as 
the Mellin transforms of £0,1 • Furthermore, by using the /^-factorisation theorem [49], one may determine 
leading small x contributions to all orders to hard partonic cross sections for physical processes such as 
heavy quark electroproduction [49] and deep-inelastic scattering [50]. Approximate subleading results are 
also available [51,52]. 

These results for splitting functions and hard partonic cross sections can then be combined with fixed- 
order results to obtain resummed predictions for physical observables. However, it has now been known for 
some time that the LL BFKL equation is unable to describe scattering data well, even when matched to a 
fixed order expansion. Any viable resummation procedure must then, at the very least, satisfy the following 
requirements: 

1. Include a stable solution to the BFKL equation with running coupling up to NLL order. 

2. Match to the standard DGLAP description at moderate and high x values (where this is known to 
describe data well). 

3. Provide the complete set of splitting and coefficient functions for Fi and Fl in a well defined factorisa- 



Over the past few years, three approaches have emerged which, to some extent, aim at fulfilling these 
conditions. Here we call these the ABF [53-60], CCSS [48, 61-67] and TW [68-73] approaches. In the 
ABF scheme all three requirements are met, and resummed splitting functions in the singlet sector have been 
determined. Furthermore, a complete control of the scheme dependence at the resummed level has been 
achieved, thereby allowing for a consistent determination of resummed deep-inelastic coefficient functions, 
and thus of resummed structure functions. However, the results obtained thus have not been fit to the data yet. 
In the CCSS formalism, resummed splitting functions have also been determined. However, results are given 
in a scheme which differs from the MS scheme at the resummed level; furthermore, resummed coefficient 
functions and physical observables haven't been constructed yet. The TW approach, instead, has already been 
compared to the data in a global fit. However, this approach makes a number of simplifying assumptions and 
the ensuing resummation is thus not as complete as that which obtains in other approaches: for example, this 
approach does not include the full collinear resummation of the BFKL kernel. 

A comparison of resummed splitting functions and solution of evolution equations determined in the 
ABF and CCSS approaches with rif = was presented in Ref. [1]; the main features and differences of these 
approaches were also discussed. Here, we extend this comparison to the case of nj / resummation, and also 
to the TW approach. First, we will briefly summarize the main features of each approach, and in particular we 
display the matrix of splitting functions determined in the ABF and CCSS approaches. Then, we will compare 
iT-factors for physical observables determined using the ABF and TW approach. 

Note that there are some difference in notations between various groups, which are retained here in 
order to simplify comparison to the original literature. In particular, the variable N in Eq. © will be referred 
to as uj in the CCS approach of Section l2.2.2l and the variable 7 in the same equation will be referred to as M 
in the ABF approach of Section 12.2.1 1 

2.2. 1 The Altarelli-Ball-Forte (ABF) Approach 

In the ABF approach [53-60, 74-77] one concentrates on the problem of obtaining an improved anomalous 
dimension (splitting function) for DIS which reduces to the ordinary perturbative result at large ./V (large x), 




(6) 



tion scheme. 



thereby automatically satisfying renormalization group constraints, while including resummed BFKL correc- 
tions at small iV (small x), determined through the renormalization-group improved (i.e. running coupling) 
version of the BFKL kernel. The ordinary perturbative result for the singlet anomalous dimension is given by: 

j(N,a s ) =a s7o (iV) + a 2 s7l (N) + a 3 s j 2 (N) .... (7) 

The BFKL corrections at small N (small x) are determined by the BFKL kernel x(M, a s ): 

X (M, a s ) = a sX0 (M) + a 2 sX i(M) + . . . , (8) 

which is the Mellin transform, with respect to t = In of the N — > angular averaged BFKL kernel. 
The ABF construction is based on three ingredients. 

1 . The duality relation between the kernels x an d 7 

x(-y(N,a a ),a a ) = N, (9) 

which is a consequence of the fact that at fixed coupling the solutions of the BFKL and DGLAP equa- 
tions should coincide at leading twist [53,74,78]. By using duality, one can use the perturbative ex- 
pansions of 7 and x m powers of a s to improve (resum) each other: by combining them, one obtains a 
"double leading" (DL) expansion which includes all leading (and subleading, at NLO) logs of x and Q 2 . 
In particular, the DL expansion automatically resums the collinear poles of x at M = 0. This eliminates 
the alternating sign poles +1/M, — 1/M 2 , that appear in xo> ■ ■ > an d make the perturbative ex- 
pansion of x unreliable. This result is a model independent consequence of momentum conservation 
7(1, a s ) = 0, whence, by duality: 

X (0,a s ) = l. (10) 

2. The symmetry of the BFKL kernel upon gluon interchange. In Mellin space, this symmetry implies 
that at the fixed-coupling level the kernel x f° r evolution in In must satisfy xiM) = x(l ~~ M). 
By exploiting this symmetry, one can use the collinear resummation of the region M ~ which was 
obtained using the double-leading expansion to also improve the BFKL kernel in the anti-collinear 
M ~ 1 region. This leads to a symmetric kernel which is an entire function for all M, and has a 
minimum at M = ~ . The symmetry is broken by the DIS choice of variables In ~ = In and by the 
running of the coupling; however these symmetry breaking contribution can be determined exactly. This 
then leads to a stable resummed expansion of the resummed anomalous dimension at the fixed coupling 
level. 

3. The running-coupling resummation of the BFKL solution. Whereas running coupling corrections to 
evolution equations are automatically included when solving the DGLAP evolution equation with re- 
summed anomalous dimensions, the duality relation Eq. Q itself undergoes corrections when the run- 
ning coupling is included in the BFKL equation Q. Running coupling corrections can then be derived 
order by order, and turn out to be affected by singularities in Mellin M space. This implies that after 
Mellin inversion the associate splitting functions is enhanced as x — > 0: their contribution grows as 
(a s f3o In i) n with the perturbative order. However the series of leading enhanced contribution can be 
summed at all orders in closed form, because it corresponds to the asymptotic expansion in powers of a s 
of the solution to the running coupling BFKL equation © when the kernel x is approximated quadrati- 
cally about its minimum. This exact solution can be expressed in terms of Airy functions [54, 79] when 
the kernel is linear in a s and in terms of Bateman [56] functions for generic kernels. Because both the 
exact solution and its asymptotic expansion are known, this BFKL running coupling resummation can 
be combined with the DGLAP anomalous dimension, already resummed at the BFKL fixed coupling 
level, with full control of overlap (double counting terms). Schematically, the result has the following 
form: 

7h C NLO^ s (t),N) = 7^ r o t («s(t),iV) + 7 B («s(t),iV) -if (a s (t),N) - 7 *(a s (t) , N) 
-jf St0 (a s (t),N)+ lraatch (a s (t),N)+ lmom (a s (t),N), (11) 




Fig. 4: The resummed splittings functions P qq , P qg , P gq and P gg in the ABF approach, all for n/ = 4 and q s = 0.2: LO DGLAP 
(dashed black), NLO DGLAP (solid black), NNLO DGLAP (solid green), LO resummed (red dashed), NLO resummed in the Q MS 
scheme (red) and in the MS scheme (blue). 



where ^^^ {a s {t), N) contains all terms which are up to NLO in the double-leading expansion of 
point 1, symmetrized as discussed in point 2 above so that its dual \ has a minimum; j B (a s (t), N) 
resums the series of singular running coupling corrections using the aforementioned exact BFKL solu- 
tion in terms of a Bateman function; j^(a s (t), N), j^ s (a s (t),N) 7^ (a s (t), N) are double counting 
subtractions between the previous two contributions; 7 mom subtracts subleading terms which spoil exact 
momentum conservation; 7 ma t c h subtracts any contribution which deviates from NLO DGLAP and at 
large N doesn't drop at least as jj. 

The anomalous dimension obtained through this procedure has a simple pole as a leading small- A r (i.e. 
small x) singularity, like the LO DGLAP anomalous dimension. The location of the pole is to the right of the 
DGLAP pole, and it depends on the value of a s . Thanks to the softening due to running of the coupling, this 
value is however rather smaller than that which corresponds to the leading BFKL singularity: for example, for 
a s = 0.2, when rif = the pole is at N = 0.17. 

The splitting function obtained by Mellin inversion of the anomalous dimension eq. (TTTT t turns out to 
agree at the percent level to that obtained by the CCSS group by numerical resolution of the BFKL equation 
for all x < 10~ 2 ; for larger values of x (i.e. in the matching region) the ABF result is closer to the NLO 
DGLAP result. 

In order to obtain a full resummation of physical observables, specifically for deep-inelastic scattering, 
the resummation discussed so far has to be extended to the quark sector and to hard partonic coefficients. This, 
on top of various technical complications, requires two main conceptual steps: 

• A factorization scheme must be denned at a resummed level. Because only one of the two eigenvectors 




Fig. 5: The resummed DIS coefficient functions C29, C29, Ch q and Cl 9 in the ABF approach, all for n/ = 4 and a s = 0.2. The 
curves are labelled as in the previous figure. 



of the matrix of anomalous dimensions is affected by resummation, once a scheme is chosen, the resum- 
mation discussed above determines entirely the two-by-two matrix of splitting functions in the singlet 
sector. The only important requirement is that the relation of this small x scheme choice to standard 
large x schemes be known exactly, since this enables one to combine resummed results with known 
fixed order results. 

• PDFs evolved using resummed evolution equations must be combined with resummed coefficient func- 
tions. These are known, specifically for DIS [50], but are also known [80] to be affected by singularities, 
analogous to the running coupling singularities of the resummed anomalous dimension discussed above, 
which likewise must be resummed to all orders [58]. This running coupling resummation of the coef- 
ficient function significantly softens the small x growth of the coefficient function and substantially 
reduces its scheme dependence [59]. 

These steps have been accomplished in Ref. [59], where resummed anomalous dimensions (see fig-H]), 
coefficient functions (see figj5]) and structure functions (see section l?.2.4l below) have been determined. The 
scheme dependence of these results can be studied in detail: results have been produced and compared in 
both the MS and QoMS schemes, and furthermore the variation of results upon variation of factorization and 
renormalization scales has been studied. 

Calculations of resummation corrections not only of deep inelastic processes, but also of benchmark 
hadronic processes such as Drell-Yan, vector boson, heavy quark and Higgs production are now possible and 
should be explored. 



2.2.2 The Ciafaloni-Colferai-Salam-Stasto (CCSS) Approach 

The Ciafaloni-Colferai-Salam-Stasto (CCSS) resummation approach proposed in a series a papers [48,61-67] 
is based on the few general principles: 

• We impose the so-called kinematical constraint [81-83] onto the real gluon emission terms in the BFKL 
kernel. The effect of this constraint is to cut out the regions of the phase space for which k!^ > k\jz 
where hp, k' T are the transverse momenta of the exchanged gluons and z is the fraction of the longitu- 
dinal momentum. 

• The matching with the DGLAP anomalous dimension is done up to the next-to-leading order. 

• We impose the momentum sum rule onto the resummed anomalous dimensions. 

• Running coupling is included with the appropriate choice of scale. We take the argument of the running 
coupling to be the transverse momentum squared of the emitted gluon in the BFKL ladder in the BFKL 
part. For the part which multiplies the DGLAP terms in the eigenvalue equation we choose the scale to 
be the maximal between k\ and k^. 

• All the calculations are performed directly in momentum space. This in particular enables easy imple- 
mentation of the running of the coupling with the choice of the arguments as described above. 

The implementation at the leading logarithmic level in BFKL and DGLAP (and in the single gluon 
channel case) works as follows. It is convenient to go to the Mellin space representation where we denote by 
7 and oj the Mellin variables conjugated to Inkp and In 1/x respectively. The full evolution kernel can be 
represented as a series K, = J2 n a ™ +1 ^-n(7> We take the resummed kernel at the lowest order level to be 

Ko(7,w) = — fr) + [7o S » " — ]X?(7) ■ (12) 

UJ UJ 

The terms in (fT2t are the following 

X £( 7 ) = 2V(1) - V(7) " ^(1 " 7 + w) , 

is the leading logarithmic BFKL kernel eigenvalue with the kinematical constraint imposed. This is reflected 
by the fact that the singularities in the 7 plane at 7 = 1 are shifted by the oj. This ensures the compatibility 
with the DGLAP collinear poles, in the sense that we have only single poles in 7. The function Xc{l) is the 
collinear part of the kernel 

Xg(7) = -+ , \ , 
7 1 — 7 + OJ 

which includes only the leading collinear poles at 7 = or 1. All the higher twist poles are neglected for 
this part of the kernel. This kernel eigenvalue is multiplied by the non-singular (in oj) part of the DGLAP 
anomalous dimension ^q 9 (oj) — 2Ca/u where 7q 9 (w) is the full anomalous dimension at the leading order. 
The next-to-leading parts both in BFKL and DGLAP are included in the second term in the expansion, i.e. 
kernel K\ 

/C l(7 , w ) = ^^xi(7) + 7 f (^)XcH (13) 



where X\{l) is the NLL in x part of the BFKL kernel eigenvalue with subtractions. These subtractions are 
necessary to avoid double counting: we need to subtract the double and triple collinear poles in 7 which are 
already included in the resummed expression (fT2b and which can be easily identified by expanding this expres- 
sion in powers of oj and using the LO relation oj = a s xo{l)- The term ^(oj) in Eq. ( [TBI is chosen so that one 
obtains the correct DGLAP anomalous dimension at a fixed next-to-leading logarithmic level. The formalism 
described above has been proven to work successfully in the single channel case, that is for evolution of gluons 
only. The solution was shown to be very stable with respect to the changes of the resummation scheme. 

The quarks are included in the CCSS approach by a matrix formalism. The basic assumptions in this 
construction are: 

• Consistency with the collinear matrix factorization of the PDFs in the singlet evolution. 
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Fig. 6: Gluon-induced part of the Green function for the NLx-NLO and NLx-NL0 + models, compared to the results the single 
channel approach. For the models of this paper both gluon-gluon and quark-gluon Green 's function are shown. The value chosen for 
the coupling, a s = 0.15, corresponds to ko — 20GeV. The band indicates the spread in the result for the NLx-NLO model when 
varying the renormalization scale in the range 0.5 < < 2. 



• Requirement that only single pole singularities in both in 7 and u> are present in the kernel eigenvalues. 
This assumption allows for the natural consistency with DGLAP and BFKL respectively. Higher order 
singularities can be generated at higher orders only through the subleading dependencies on these two 
variables. 

• Ability to compute all the anomalous dimensions which can be directly compared with the DGLAP 
approach. This can be done by using set of recursive equations which allow to calculate the anomalous 
dimensions order by order from the kernel eigenvalues. 

• Impose the collinear-anticollinear symmetry of the kernel matrix via the similarity transformation. 

• Incorporate NLLx BFKL and DGLAP up to NLO (and possibly NNLO). 

The direct solutions to the matrix equations are the quark and gluon Green's functions. These are 
presented in Fig. [6] for the case of the gluon-gluon and quark-gluon part. The resulting gluon-gluon part is 
increasing exponentially with the logarithm of energy Ins with an effective intercept of about ~ 0.25. It is 
much suppressed with respect to the leading logarithmic order. We also note that the single channel results and 
the matrix results for the gluon-gluon Green's function are very similar to each other. In Fig.|6]we also present 
the quark-gluon channel which is naturally suppressed in normalization with respect to the gluon-gluon one 
by a factor of the strong coupling constant. This can be intuitively understood as the (singlet) quarks are 
radiatively generated from the gluons, and therefore this component follows the gluon density very closely. 
The yellow bands indicate the change of the Green's functions with respect to the change of the scale. 

In Fig. |7] we present all four splitting functions for fixed value of scale Q 2 . Here, again the results 
are very close to the previous single channel approach in the case of the gluon-gluon splitting function. The 
gluon-quark channel is very close to the gluon-gluon one, with the characteristic dip of this function at about 
x ~ 10~ 3 . The dip delays the onset of rise of the splitting function only to values of x of about 10 -4 . The 
scale dependence growths with decreasing x but it is not larger than in the fixed NLO case. The quark-gluon 
and quark-quark splitting functions tend to have slightly larger uncertainty due to the scale change but are also 
slightly closer to the plain NLO calculation. They also tend to have a less pronounced dip structure. 



10° 10° KT 10"° 10 



0.04 
0.03 



°- 0.02 

X 



- 1 1 — r| 1 1 — r| 1 1 — r| 1 1 — r| 1 1 — rp- 

NLx-NLO 



NLx-NLO + 

NLO 



0.10 

Q- 

x 




1 10~ 6 10~ 5 10~ 4 10~ 3 10~ 2 10" 1 1 



Fig. 7: The matrix of NLx-NLO (and NLx-NLO + ) splitting functions together with their scale uncertainty and the NLO splitting 
functions for comparison. In the gg channel, we also show the old scheme B result (nj = 0, no NLO contributions, 1-loop coupling) 
. The band corresponds to the span of results (NLx-NLO) obtained if one chooses x M = 0.5 and x^ = 2.0. 



2.2.3 The Thome-White (TW) Approach 

Substituting the LO running coupling as(k 2 ) into equation © and performing a double Mellin transform 
according to equation ©, the BFKL equation [51 as mentioned in Section I2T21 becomes a differential equation: 



d 2 f( 7 ,N) _ d 2 f!( 7 ,Q 2 Q ) 1 d( X oh)fh,N)) 7T 
d-f 2 dj 2 f3 N dj 3$N 



+ ^277Xi(l)f(l,N), (14) 



where Xo,i(t) w& the Mellin transforms of /Co,i- The solution for f(N, 7) of Eq. (fT4l) has the following 
form [62, 84]: 

/(Ar , 7) . exp (_^))/;, Wexp (^m)^ 

Up to power-suppressed corrections, one may shift the lower limit of the integral 7 — > 0, so that the gluon 
distribution factorises into the product of a perturbative and a non-perturbative piece. The nonperturbative 
piece depends on the bare input gluon distribution and an in principle calculable hard contribution. However, 
this latter part is rendered ambiguous by diffusion into the infrared, and in this approach is contaminated by in- 
frared renormalon-type contributions. The perturbative piece is safe from this and is sensitive to diffusion into 
the ultraviolet region of weaker coupling. Substituting equation (IT5b into (fT4l) . one finds that the perturbative 
piece is given (after transforming back to momentum space): 



1 /■1/2+ico ff3 

GkN,t) = — ^exp[ 1 t-X l ( 1 ,N)/(p N)]d 7 , (16) 

2w» Vl/2-toc 7 
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Fig. 8: Gluons arising from a global fit to scattering data including NLL small x resummations in the DIS(x) factorisation scheme 
(solid). Also shown is the result from an NLO DGLAP fit in the same scheme. 



where: 



Xo(t) 



dj. (17) 



Structure functions Fj also factorize, and the perturbative factors have a similar form to Eq. (U6l) . but involve an 
additional impact factor ^(7, N) in the integrand according to the /c t -factorisation theorem [50]. Crucially, 
coefficient functions and anomalous dimensions involve ratios of the above quantities, such that the non- 
perturbative factor cancels. Thus, once all the impact factors are known, the complete set of coefficient and 
splitting functions can be disentangled. Finally they can be combined with the standard NLO DGLAP results 
(which are known to describe data well at higher x values) using the simple prescription: 



ptot. _ pNLL , pNLO 



pNLL(0) , pNLL{\) 



(18) 



where P is a splitting or coefficient function, and p NLL (' t ) the 0{a\) contribution to the resummed result 
which is subtracted to avoid double-counting. It should be noted that the method of subtraction of the re- 
summed contribution in the matching is different to that for the ABF approach outlined after Eq. (fTTT t. For 
example, at NLO in the resummation the BFKL equation provides both the a$/N part of P gg and the part at 
0(as) constant as N — > 00. Hence we choose to keep all terms constant as N — > 00 generated by Eq. (fT6l ). 
with similar considerations for other splitting functions and coefficient functions, though these can contain 
terms oc N. Hence, we include terms which will have some influence out to much higher x than in the ABF 
approach. 

In the TW manner of counting orders LL is defined as the first order at which contributions appear, 
so while for the gluon splitting function this is for agln m (l/a;) for m = n — 1 for impact factors this is 
for m = n — 2. A potential problem therefore arises in that the NLL impact factors are not known exactly. 
However, the LL impact factors with conservation of energy of the gluon imposed are known in cases of both 
massless and massive quarks [51,52], and are known to provide a very good approximation to the full 
and 0(a 3 s ) quark-gluon splitting functions and coefficient functions [85], implying that they must contain 
much of the important higher-order information. These can then be used to calculate NLL coefficient and 
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Fig. 9: The resummed splitting functions (solid) 
to the corresponding NLO forms (dotted). 



P gg and P qg in the TW approach, both for nf = 4 and as = 0.16, compared 



splitting functions within a particular factorisation scheme. One must also specify a general mass variable 
number scheme for consistent implementation of heavy quark mass effects. Such a scheme (called the DIS(x) 
scheme) has been given in [72, 73] up to NLL order in the high energy expansion, and NLO order in the fixed 
order expansion. 

The form of the resummed splitting functions shown in fig. [9] are qualitatively consistent with those 
from the ABF approach, fig. HJ and CCSS approach fig. [7] (note however that in these plots the value of a s 
is a little larger, and the scheme is different). This is despite the fact that the approach does not include the 
explicit collinear resummation of the BFKL kernel adopted in the other two approaches. It was maintained in 
[70,71] that the diffusion into the ultraviolet, effectively making the coupling weaker, hastens the perturbative 
convergence for splitting functions, and the kernel near 7 = 0, making this additional resummation less 
necessary. There is no particular obstruction to including this resummation in the approach, it is simply 
cumbersome. Indeed, in Ref. [71] the effect was checked, and modifications found to be no greater than 
generic NNLO corrections to the resummation, so it was omitted. (Note that any process where there are two 
hard scales, sensitive to 7 0.5, or attempted calculation of the hard input for the gluon distribution, sensitive 
to 7 = 1, would find this resummation essential.) The main feature of the resummed splitting functions is 
a significant dip below the NLO DGLAP results, followed by an eventual rise at very low x ~ 10~ 5 . This 
behaviour drives a qualitative change in the gluon distribution, when implemented in a fit to data. 

The combined NLO+NLL splitting and coefficient functions (in the TW approach) have been imple- 
mented in a global fit to DIS and related data in the DIS(x) scheme, thus including small x resummations in 
both the massless and massive quark sectors [73]. The overall fit quality was better than a standard NLO fit 
in the same factorisation scheme, and a similar NLO fit in the more conventional MS factorisation scheme. 
The principal reason for this is the dip in the resummed evolution kernels, which allows the gluon distribution 
to increase at both high and low values of x. This reduces a tension that exists between the high x jet data 
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Fig. 10: Recent HI data on the longitudinal structure function Fl, together with the NLL resummed prediction from the TW approach, 
and a recent NNLO result from the MSTW group. 



of [86, 87] and the low x HERA data [18, 88-91]. The gluon distributions arising from the NLL and NLO 
fits are shown in figure [U for the starting scale Q 2 = lGeV 2 and also for a higher value of Q 2 . One sees 
that whilst the NLO gluon wants to be negative at low x and Q 2 , the resummed gluon is positive definite and 
indeed growing slightly as x — ► 0. The gluons agree well for higher x values (where the DGLAP description 
is expected to dominate), but deviate for x < 10~ 2 . This can therefore be thought of as the value of x below 
which resummation starts to become relevant. 

The qualitatively different gluon from the resummed fit (together with the decreased evolution kernels 
w.r.t. the fixed order description) has a number of phenomenological implications: 

1 . The longitudinal structure function Fl is sensible at small x and Q 2 values, where the standard DGLAP 
description shows a marked instability [92]. 

2. As a result of the predicted growth of Fl at small x the resummed result for the DIS reduced cross- 
section shows a turnover at high inelasticity y, in agreement with the HERA data. This behaviour is not 
correctly predicted by some fixed order fits. 

3. The heavy flavour contribution (from charm and bottom) to Fi is reduced at higher Q 2 in the resummed 
approach, due mainly to the decreased evolution, as already noted in a full analysis in the fixed-order 
expansion at NNLO [93]. Nevertheless, it remains a significant fraction of the total structure function at 
small x. 

Other resummation approaches should see similar results when confronted with data, given the quali- 
tative (and indeed quantitative) similarities between the splitting functions. It is the decreased evolution with 
respect to the DGLAP description that drives the qualitative change in the gluon distribution. This is then the 
source of any quantitative improvement in the description of data, and also the enhanced description of the 
longitudinal structure function and reduced cross-section. 

The resummed prediction for Fl is shown alongside the recent HI data [94] in figure [lOl and compared 
with an up-to-date NNLO fixed order result [95]. One sees that the data cannot yet tell apart the predictions, 
but that they are starting to diverge at low x and Q 2 , such that data in this range may indeed be sensitive to the 
differences between resummed and fixed order approaches. 
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Fig. 1 1 : The ratio Fi fLL / Fi fLO in the ABF approach (left) and the TW approach (right), using toy PDFs, given in eq. [20] calculated 
as function of x at fixed for Q 2 (upper ), and as a function of Q 2 at fixed x (lower). 



2.2.4 Resummed structure functions: comparison of the ABF and TW approaches 

In this section, we present an application of the ABF and TW approaches to the resummed determination 
of the i<2 and Fl deep-inelastic structure functions. The corresponding exercise for the CCSS approach has 
not yet been finalised. A direct comparison of the two approaches is complicated by issues of factorisation 
scheme dependence: whereas in the ABF approach results may be obtained in any scheme, and in particular 
the MS and closely related Qo-MS scheme, in the TW formalism splitting functions and coefficient functions 
beyond NLO in as are resummed in the Qo-DIS scheme [66,96], which coincides with the standard DIS 
scheme at large x but differs from it at the resummed level; the scheme change needed in order to obtain the 
coefficient functions from the DIS -scheme ones is performed exactly up to NLO and approximately beyond 
it. Thus, without a more precise definition of the relation of this scheme to MS, one cannot compare splitting 
and coefficient functions, which are factorisation scheme dependent. 

A useful compromise is to present the respective results for the ratio of structure function predictions: 

F^ LL (x O 2 ) 

K - 1 { ' ^ ' (191 

Kt ~ Fi* L °( x ,Qiy 

where i £ 2, L, and the Fi are calculated by convoluting the relevant coefficients with PDFs obtained by 
perturbative evolution of a common set of of partons, defined at a starting scale of Q\ = 4GeV 2 . The number 
of flavors is fixed to three, to avoid ambiguities due to heavy quark effects. The initial PDFs are assumed to 
be fixed (i.e., the same at the unresummed and unresummed level) in the DIS factorization scheme at the scale 
Qq. Of course, in a realistic situation the data are fixed and the PDFs are determined by a fit to the data: hence 
they are not the same at the resummed and unresummed level (compare Fig. [8] above). However, in the DIS 
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Fig. 12: The ratio F^ LL jFl LO in the ABF approach (left) and the TW approach (right), using toy PDFs, given in eq. 1201 calculated 
as function of x at fixed for Q 2 (upper ), and as a function of Q 2 at fixed x (lower). 

factorization scheme the structure function F2 is simply proportional to the quark distribution, hence by fixing 
the PDFs in this scheme one ensures that F2 is fixed at the starting scale. 

This starting PDFs are constructed as follows: the quark and gluon distributions are chosen to have the 
representative form also used in Ref. [59] 

xg(x) = k s xS{x) = k g x~ 0A8 (l - x) 5 ; xq v = k q x°- 5 (l - xf, (20) 

in the MS scheme, where g(x) is the gluon, S(x) the sea quark distribution, and xq v (x) denotes a valence 
quark distribution. We choose k s = 3, and then all other parameters are fixed by momentum and number sum 
rules. Note that the gluon is the same as that used in the previous comparison of Ref. [1]. The PDFs eq. (l20l 
are then transformed to the DIS factorization scheme [97] using the NLO (unresummed) scheme change at 
the scale Qq. The result is then used as a fixed boundary condition for all (unresummed and resummed, ABF 
and TW) calculations. In the TW approach, the DIS scheme for unresummed quantities and QoDIS scheme 
as discussed above is then used throughout. In the ABF approach, the fixed DIS-scheme boundary condition 
is transformed to the QoMS scheme [59, 98] (which at the unresummed level coincides with standard MS) 
by using the unresummed or resummed scheme change function as appropriate, and then all calculations are 
performed in QoMS. One might hope that most of the residual scheme dependence cancels upon taking 
the ratio of the NLL and NLO results, at least for schemes that are well defined and without unphysical 
singularities. 

The results for K2 and Kl are shown in figures [TT] for F2 in the ABF and TW procedures respectively 
and similarly in figures[T2]for Fl. One sees that for x sufficiently small, and for Q not too large, the resummed 
F2 is consistently lower than its fixed order counterpart in both approaches, due to the decreased evolution of 
the gluon, and also (in the MS scheme) due to the fact that resummed coefficient functions are much larger 



than the NLO ones at small x and low Q 2 . Similarly the resummed Fl is larger than the fixed order at low Q 
and small enough x, but falls rapidly as Q increases. However despite these superficial similarities, the two 
approaches differ quantitatively in several respects: 

• the ABF resummed F2 matches well to the NLO for x <; 10 -2 at all scales, while the TW F2 shows a 
rise around x ~ 10 -2 , which is largest at low Q. This may be due to the significant differences between 
resummed and NLO splitting functions at very high x in fig. [9] A similar mismatch may be seen at 
x ~ 0.1 in the Fl K-f actor. 

• at large scales the ABF resummation stabilises, due to the running of the coupling, so the K-factors 
becomes rather flat: they grow only logarithmically in In Q. By contrast the TW F2 K-factor still shows 
a marked Q 2 dependence. This may be related to the fact that the TW resummation does not resum 
the collinear singularities in the BFKL kernel, and to the TW choice (see Sect. 12.2.31 ) not to include 
subtraction of terms induced by the resummation which do not drop at large x. This choice induces a 
change in the PDFs at higher x in the TW approach, which results in effects which persist to higher Q 2 
at smaller x. 

• at the initial scale Qq the TW resummed Fl grows much more strongly as x decreases than the ABF 
resummed Fl. This is likely to be due to the different treatment of the coefficient functions: in this 
respect, the fully consistent treatment of the factorization scheme, the effect of collinear resummation, 
and the different definitions of what is called resummed NLO used by the two groups all play a part. 

2.2.5 Conclusion 

The problem of understanding the small x evolution of structure functions in the domain of x and Q 2 values 
of relevance for HERA and LHC physics has by now reached a status where all relevant physical ingredients 
have been identified, even though not all groups have quite reached the stage at which the formalism can be 
transformed into a practical tool for a direct connection with the data. 

In this report we summarised the status of the three independent approaches to this problem by ABF, 
CCSS and TW, we discussed the differences in the adopted procedures and finally we gave some recent results. 
The most complete formalisms are those by ABF and CCSS while the TW approach is less comprehensive 
but simpler to handle, and thus has been used in fit to data. We recall that, at the level of splitting functions 
the ABF and CCSS have been compared in ref. [1] and found to be in very good agreement. The singlet 
splitting function obtained by TW was also compared with ABF and CCSS in ref. [73] and also found to be in 
reasonable agreement, at least at small x. 

Here we have shown the results of an application to the structure functions F2 and Fl of the ABF and 
TW methods. The same input partem densities at the starting scale Qq were adopted by these two groups 
and the /^-factors for resummed versus fixed NLO perturbative structure functions were calculated using the 
respective methods. The results obtained are in reasonable qualitative agreement for F2, less so for Fl. Dis- 
crepancies may in part be due to the choice of factorization scheme, but our study suggests that the following 
are also likely to make a quantitative difference: whether or not a resummation of collinear singularities in 
the BFKL kernel is performed, whether contributions from the resummation which persist at large x are sub- 
tracted and whether the factorization scheme is consistently defined in the same way at resummed and NLO 
levels. 

2.3 Parton saturation and geometric scaling 22 

2.3.1 Introduction 23 

The degrees of freedom involved in hadronic collisions at sufficiently high energy are partons, whose density 
grows as the energy increases (i.e., when x, their momentum fraction, decreases). This growth of the number 
of gluons in the hadronic wave functions is a phenomenon which has been well established at HERA. One 
expects however that it should eventually "saturate" when non linear QCD effects start to play a role. 

22 Contributing authors: G. Beuf, F. Caola, F. Gelis, L. Motyka, C. Royon, D. Salek, A. M. Stasto 
23 Contributing authors: F. Gelis, A. M. Stasto 
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An important feature of partonic interactions is that they involve only partons with comparable ra- 
pidities. Consider the interaction between a hadron and some external probe (e.g. a virtual photon in Deep 
Inelastic Scattering) and consider what happens when one boosts the hadron, increasing its rapidity in succes- 
sive steps. In the first step, the valence constituents become Lorentz contracted in the longitudinal direction 
while the time scale of their internal motions is Lorentz dilated. In addition, the boost reveals new vacuum 
fluctuations coupled to the boosted valence partons. Such fluctuations are not Lorentz contracted in the longi- 
tudinal direction, and represent the dynamical degrees of freedom; they are the partons that can interact with 
the probe. Making an additional step in rapidity would freeze these fluctuations, while making them Lorentz 
contracted as well. But the additional boost also produces new quantum fluctuations, which become the new 
dynamical variables. This argument can be repeated, and one arrives at the picture of a high-energy projectile 
containing a large number of frozen, Lorentz contracted partons (the valence partons, plus all the quantum 
fluctuations produced in the previous boosts), and partons which have a small rapidity, are not Lorentz con- 
tracted and can interact with the probe. This space-time description was developed before the advent of QCD 
(see for instance [99]; in Bjorken's lectures [100], one can actually foresee the modern interpretation of partem 
evolution as a renormalization group evolution). 

This space-time picture, which was deduced from rather 
general considerations, can now be understood in terms of 
QCD. In fact, shortly after QCD was established as the theory of 
strong interaction, quantitative equations were established, de- 
scribing the phenomenon outlined above [42, 101-105]. In par- 
ticular - , the equation derived by Balitsky, Fadin, Kuraev and Li- 
patov [42, 101] describes the growth of the non-integrated gluon 
distribution in a hadron as it is boosted towards higher rapidi- 
ties. Experimentally, an important increase of the number of 
gluons at small x has indeed been observed in the DIS exper- 
iments performed at HERA (see Fig. IT3T >. down to x ~ 10~ 4 . 
Such a growth raises a problem: if it were to continue to arbi- 
trarily small x, it would induce an increase of hadronic cross- 
sections as a power of the center of mass energy, in violation of 
known unitarity bounds. 

However, as noticed by Gribov, Levin and Ryskin in 
[106], the BFKL equation includes only branching processes 
that increase the number of gluons (g — ► gg for instance), but 
not the recombination processes that could reduce the number of gluons (like gg — > g). While it may be legiti- 
mate to neglect the recombination process when the gluon density is small, this cannot remain so at arbitrarily 
high density: a saturation mechanism of some kind must set in. Treating the partons as ordinary particles, one 
can get a crude estimate of the onset of saturation, which occurs at: 
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Fig. 13: The gluon structure function in a proton mea- 
sured at HERA. 
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The momentum scale that characterizes this new regime, Q s , is called the saturation momentum [107]. Partons 
with transverse momentum Q > Q s are in a dilute regime; those with Q < Q s are in the saturated regime. 
The saturation momentum increases as the gluon density increases. This comes from an increase of the gluon 
structure function as x decreases. The increase of the density may also come from the coherent contributions 
of several nucleons in a nucleus. In large nuclei, one expects Q 2 S oc A 1 / 3 , where A is the number of nucleons 
in the nucleus. 

Note that at saturation, naive perturbation theory breaks down, even though a s (Q s ) may be small if 
Q s is large: the saturation regime is a regime of weak coupling, but large density. At saturation, the gluon 
occupation number is proportional to l/a s . In such conditions of large numbers of quanta, classical field 
approximations become relevant to describe the nuclear wave-functions. 

Once one enters the saturated regime, the evolution of the partem distributions can no longer be described 
by a linear equation such as the BFKL equation. The color glass condensate formalism (for a review, see 



[108]), which relies on the separation of the degrees of freedom in a high-energy hadron into frozen partons and 
dynamical fields, as discussed above, provides the non linear equations that allow us to follow the evolution 
of the partonic systems form the dilute regime to the dense, saturated, regime. For instance, the correlator 
tv(W (x±)U(yj_)) of two Wilson lines -which enters in the discussion of DIS- evolves according to the 
Balitsky-Kovchegov [109,110] equation: 

dtv(W(x ± )U(y ± )) x _ a s f (x±~y ± ) 2 



dln(l/x) 2tt 2 J z± ( x± - z ± ) 2 (y ± - z±_) 2 

N c tr(uHx ± )U(y ± )) x - tr (U^(x ± )U (z ± )) (U^z ± )U (y ± )) x 



(22) 



(This equation reduces to the BFKL equation in the low density limit.) 

The geometric scaling phenomenon was first introduced in the context of the dipole picture of the deep 
inelastic electron-proton scattering [111]. The process of the scattering of the virtual photon on a proton at 
very small values of x can be conveniently formulated in the dipole model. In this picture the photon fluctuates 
into the quark-antiquark pair (dipole) and subsequently interacts with the target. In the small x regimes these 
two processes factorize and they can be encoded into the dipole formula for the total j*p cross section 

^t,l(x,Q 2 ) = J d 2 v J dz\^ TyL (r,z,Q 2 )\ 2 a{x,r) (23) 

where ^t,l is the wave function for the photon and a is the dipole cross section, r is the dipole size and z is 
the light-cone fraction of the longitudinal momentum carried by the quark (or antiquark). The photon wave 
functions \& are known, the dipole cross section can be expressed in terms of the correlator of Wilson lines 
whose evolution is driven by Eq. d22l : 



a(x,r) = ^J d2x tr(l - U(X + l)rf(X - |)) . (24) 

Alternatively, it can be modeled or extracted from the data. In the GBW model it was assumed that the dipole 
cross section has a form 

a = a [1 - exp(-r 2 /i?o(^) 2 )] (25) 

where Rq(x) = (x/xq)~ x is a saturation radius (its inverse is usually called the saturation scale Q s (x)) and 
(To a normalisation constant. One of the key properties of the model was the dependence on the dipole size 
and the Bjorken x through only one combined variable r 2 Q 2 s {x). This fact, combined with the property of the 
dipole formula, allows to reformulate the total cross section as a function of Q 2 /Q 2 {x) only. This feature is 
known as the geometric scaling of the total j*p cross section. Initially postulated as a property of the GBW 
model, it was then shown that the experimental data do indeed exhibit the aforementioned regularity in a rather 
wide range of Q 2 and for small values of Bjorken x. 

Although it is a postulate in the GBW model, this property can be derived from the small-x behavior 
of the solutions of Eq. (l22l [112] : for a wide class of initial conditions, the BK equation drives its solution 
towards a function that obeys this scaling. Note also that the saturation scale, introduced by hand in the GBW 
model, is dynamically generated by the non linear evolution described by Eq. (l22l . This suggested that the 
regularity seen in the data could be explained by the scaling property of the solutions to the nonlinear equations 
in the saturated regime - and thus may provide some indirect evidence for gluon saturation. 

Nevertheless, several important questions remained. One of them, is the problem of the compatibility 
of the DGLAP evolution with the property of the geometric scaling. It is known from the global fits that the 
standard DGLAP evolution works quite well for the description of the of the deep inelastic data even in the 
very low x and Q 2 regime. That suggests that the saturation should be confined to the very tight kinematic 
regime, and it is therefore questionable whether the observed regularity could be attributed to the saturation at 
all. In the present contribution we discuss several approaches to this problem. 
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Fig. 14: Fi data: Scaling curve a = cr(r) for "Fixed Cou- 
pling". A Q 2 > 1 GeV 2 cut was applied to the data. 



Fig. 15: DVCS data: Quality factor normalised to 1 plotted 
against the parameter A. Star denotes the fit result for F2 
data. 



2.3.2 Phenomenology 

In order to compare the quality of different scaling laws, it is useful to use a quantity called quality factor 
(QF). It is also used to find the best parameters for a given scaling. In the following, this method is used to 
compare the scaling results for the proton structure function F2 and F$, the deeply virtual Compton scattering, 
the diffractive structure function, and the vector meson cross section data measured at HERA. 



Quality Factor Given a set of data points (Q 2 ,x,a = a(Q 2 ,x)) and a parametric scaling variable r = 
t(Q 2 , Y, A) (with Y = In \/x) we want to know whether the cross-section can be parametrised as a function 
of the variable r only. Since the function of r that describes the data is not known, the QF has to be defined 
independently of the form of that function. 

For a set of points (uj, Vi), where t^'s are ordered and normalised between and 1, we introduce QF 
as follows [113] 



QF(X) 



4^ (ui - Uj„i) 2 + e 2 



(26) 



where e is a small constant that prevents the sum from being infinite in case of two points have the same value 
of u. According to this definition, the contribution to the sum in (1261 is large when two successive points are 
close in u and far in v. Therefore, a set of points lying close to a unique curve is expected to have larger QF 
(smaller sum in (|26l >) compared to a situation where the points are more scattered. 

Since the cross-section in data differs by orders of magnitude and r is more or less linear in log{Q 2 ), 
we decided to take ui = Tj(A) and Vi = log{o~i). This ensures that low Q 2 data points contribute to the QF 
with a similar weight as higher Q 2 data points. 



Fits to F 2 and DVCS Data We choose to consider all available data from HI, ZEUS, NMC and E665 
experiments [18,89-91, 114-117] with Q 2 in the range [1; 150] GeV 2 and x < 0.01 25 . We exclude the data 
with x > 1CT 2 since they are dominated by the valence quark densities, and the formalism of saturation does 
not apply in this kinematical region. In the same way, the upper Q 2 cut is introduced while the lower Q 2 cut 
ensures that we stay away from the soft QCD domain. We will show in the following that the data points 
with Q 2 < 1 GeV 2 spoil the fit stability. Two kinds of fits to the scaling laws are performed, either in the 

24 Contributing authors: C. Royon, D. Salek 

25 The data in the last ZEUS paper include contributions for Fl and 2F3 but those can be neglected within the kinematical domain 
we consider. 




Fig. 16: F2 data: Comparison of the A parameter for Fi Fig. 17: F% parametrisation: Scaling curve a = o(r) for 
and F£ data for Q 2 > 3 GeV 2 . fixed coupling using the MRST 2004 NNLO parametrisation 

for A = 0.33 as obtained in the fit to experimental data. No 
scaling is observed for Q 2 > 3 GeV 2 . 

full mentioned Q 2 range, or in a tighter Q 2 range [3; 150] GeV 2 to ensure that we are in the domain where 
perturbative QCD applies. 

Figure [l4]shows the scaling plot for "Fixed Coupling" in the Q 2 range [1; 150] GeV 2 , which shows that 
the lowest Q 2 points in grey have a tendency to lead to worse scaling. The QF values are similar for the "Fixed 
Coupling", "Running Coupling I", and "Running Coupling Ilbis" — with a tendency to be slightly better for 
"Running Coupling Ilbis" — and worse for diffusive scaling [118]. 

The amount of the DVCS data [119, 120] measured by HI and ZEUS is smaller (34 points for HI 
and ZEUS requiring x < 0.01 as for F2 data), therefore the precision on the A parameter is weaker. The 
kinematic coverage of the DVCS data covers smaller region in x and Q 2 than F 2 : 4 < Q 2 < 25 GeV 2 and 
5 • 10~ 4 < x < 5 • 10~ 3 . The DVCS data lead to similar A values as in the F2 data (see Fig. IT5T). showing 
the consistency of the scalings. The values of the QF show a tendency to favour "Fixed Coupling", but all 
different scalings (even "Diffusive Scaling") lead to reasonable values of QF. 

Implications for Diffraction and Vector Mesons We used the values of the parameters obtained from the fit 
to F2 data to test the various scaling variables on the diffractive cross section and vector meson data [121-123]. 
We tested both the fixed (3 scaling behaviour in xp and the fixed xp scaling behaviour in (3. At fixed (3, we 
find a scaling behaviour up to (3 = 0.65. At fixed xp, the scaling behaviour of the diffractive cross section as 
a function of (3 and Q 2 is far less obvious. This is not a surprise, as not enough data is available in the genuine 
small (3 region. A sign of scaling is however observed for the xp = 0.03 bin. 

Concerning p, J/^S, and <fi production [124-126], we found a reasonable scaling behaviour for all 
tested scaling variables, with the hard scale Q 2 + My, borrowed from vector mesons wave function studies. 
Surprisingly, the best scaling is for all three vector mesons the "Diffusive scaling". 

Fits to F2 and F% in QCD Parametrisations First we test the scaling properties using experimental F| 
data. The requirements on the kinematical domain remain the same as in the case of F2 studies. The lower 
Q 2 > 3 GeV 2 cut also allows to remove eventual charm mass effects. We use the charm F% measurements 
from the HI and ZEUS experiments [127-130]. Only 25 data points lie in the desired kinematical region. 

Since the statistics in the data is low, the fit results are not precise. Nevertheless, they still lead to 
clear results that are comparable to F2 fits. The results are found similar between F2 and F% (see Fig. [T6b - 
All A parameters are similar for F2 and F% except for "Diffusive Scaling". As in the case of the F2 scaling 



analysis, "Fixed Coupling", "Running Coupling I" and "Running Coupling II" give similar values of QF, and 
"Diffusive Scaling" is disfavoured. 

The QCD parametrisations [131-133] of the structure function have been tested using CTEQ, MRST, 
GRV. The same Q 2 and x points as in the experimental data were taken into account. Parametrisations of F2 
are able to reproduce the scaling results seen in the experimental data. However, they are not successful in 
describing the scaling properties in case of F 2 C . Fig. [FT] shows the scaling curve of "Fixed Coupling" in the 
MRST NNLO 2004 parametrisation of F% where the value of A = 0.33 is imposed (as seen in the experimental 
data). The scaling curve is plotted with all the points used in the F2 study. Therefore the fact that there is not 
just a single scaling curve in i 7 ^ parametrisation is not in direct disagreement with the data — with 25 point 
only, the curves in parametrisation and data look similar. However the fit values of A are different. 

The CTEQ, MRST or GRV parametrisations are unable to reproduce the scaling properties in F| . It 
seems a sea-like intrinsic charm component like the one used in CTEQ 6.6 C4 helps to get results closer to a 
single scaling curve [134]. Scaling is not present at all in the MRST or GRV parametrisations at low Q 2 . 

2.3.3 Geometric scaling and evolution equations with saturation 26 

Let us now recall how scaling properties arise from saturation, as shown in [112], using methods and results 
from non-linear physics (see [135, 136] for alternative demonstrations). Our discussion, independent of the 
precise saturation formalism, is valid e.g. for the JIMWLK and BK equations (see [108] and references 
therein), at LL, NLL or even higher order in log(l/x). We will discuss separately the fixed and the running 
a s cases, as running coupling is the main effect which can modify the discussion. 

Saturation amounts to add a non-linear damping contribution to the BFKL evolution. One writes for- 
mally the evolution equation at LL for the dipole-proton cross section a (1231) 

dya(Y, L) = ax{— 9l)o'(Y, L) — non-linear terms in a(Y, L) , (27) 

where Y = log(l/x), L = — \og{r 2 K^ CD ) and x{l) i s the characteristic function of the BFKL kernel. 
The nonlinear damping ensures that, for any Y, a(Y, L) grows at most as a power of \L\ for L — > —00 (i.e. 
r — > +00). The color transparency property of the dipole cross section implies a(Y, L) oc e~ L for L — > +00. 
Using a double Laplace transform with partial waves e -i L + uY > the linear part of (T27T ) reduces to the BFKL 
dispersion relation u = ax(j), which gives the partial waves solutions e^^ L ~ ax ^ Y ^' y \ In the relevant 
interval < 7 < 1, the phase velocity A(7) = (*x{l)/l has one minimum, for the critical value 7 = 7 C ~ 0.63 
which is the solution of x{lc) = IcX'dc)- In the presence of saturation terms in the evolution equation, the 
wave with 7 = 7 C is selected dynamically. 

In order to understand the dynamics of the problem, let us consider an arbitrary initial condition, at some 
rapidity Y = Yq. With the definition 7 e jj(L,Y) = — 8l log(<7(Y, L)), j e ff(L,Yo) gives the exponential 
slope of the initial condition in the vicinity of L. That vicinity will then propagates for Y > Yq at a velocity 
A(7 e //(L, Y)) = ax{leff{L,Y))/^ e f f (L,Y). One finds easily that, if j e ff(L,Y ) is a growing function of 
L, the regions of smaller velocity will spread during the Y evolution, and invade the regions of larger velocity. 
Restricting ourselves to initial conditions verifying the saturation at L — > —00 and the color transparency 
at L — ► +00 as discussed previously, one obtains that j e jf(L,Yo) goes from at low L to 1 at large L. 
At intermediate L, j e ff(L, Yq) will cross the value j c , corresponding to the minimal velocity A c = A(7 C ). 
Hence, one conclude that, as Y grows, there is a larger and larger domain in L where j e jf(L, Y) = j c and 
thus A = A c . In that domain, one has <r(Y, L) oc e~ lc( - L ~ XcY \ and hence the geometric scaling <x(Y, L) = 
f(L-\ c Y) = f(-\og(r 2 Q 2 s (x))), with a saturation scale Q 2 (x) = e XcY h? QCD = x~ Xc A 2 q CD . One finds 

that the geometric scaling window is limited to L < X C Y + y ax"(lc)Y/2, and separated from the region 
still influenced by the initial condition by a cross-over driven by BFKL diffusion. So far, we discussed only 
scaling properties of the dipole cross section a. As explained in the introduction, they imply similar scaling 
properties of the virtual photon-proton cross section, with the replacement r 1— > 1/Q. 

The mechanism of wave selection explained above happens mainly in the linear regime 27 , i.e. for small 
26 Contributing author: G. Beuf 

27 We call linear (non-linear ) regime the (Y,L) domain where the explicit value of the non-linear terms in 127 1 is (is not) negligible 
compared to the value of the linear terms. 



a, or equivalently r smaller than Q 2 s (x). However, the geometric scaling property stays also valid in the non- 
linear regime, i.e. for r larger than Q 2 (x), which is reached after a large enough evolution in Y. The only, 
but decisive, role of saturation in the linear domain is to provide the following dynamical boundary condition 
in the IR to the linear BFKL evolution: when a is large, it should be quite flat {^ e ff(L) ~ 0). Indeed, one 
can simulate successfully the impact of saturation on the solution in the linear regime by studying the BFKL 
evolution in the presence of an absorptive wall [136], set at a ^-dependent and selfconsistently determined 
position near the saturation scale. 

At NLL and higher order level, the terms different from running coupling ones do not affect the previous 
discussion. They just change the kernel eigenvalues x(l) an d thus shift the selected parameters 7 C and A c . On 
the contrary, going from fixed to running coupling brings important changes. As the mechanism of spreading 
of smaller velocity regions of the solution towards larger velocity ones is local, one expect that it holds in the 
running coupling case. But it selects coupling-dependent velocity and shape of the front, the coupling itself 
being L-dependent. Hence, the picture is the following. We still have the formation of a specific traveling 
wave front solution, which progressively loses memory of its initial condition. However, the selected values of 
the velocity and shape of the front drift as the front propagate towards larger L (smaller r), due to asymptotic 
freedom. So far, this running coupling case has been solved analytically [112, 136] only at large L and large Y, 
keeping the relevant geometric scaling variable — log(r 2 Q 2 (x)) finite. One finds that the evolution is slower 

than in the fixed coupling case, as the large Y behavior of the saturation scale is now Q 2 s {x) ~ e^ VcY ^ b AQ CD , 
with b = (33 — 2Nf)/36 and v c = 2x{^ c )hc- In addition, the geometric scaling window is narrower: asymp- 
totically in Y, it is expected to hold only for 28 L < ^v c Yjb + (|£l|/4) (x"(7c)) 1/3 ^ 1/6 /( 2& 7cX(7c)) 1/6 - 
The convergence of the selected front towards this asymptotic solution seems rather slow, which may weaken 
its phenomenological relevance. The whole theoretical picture is nevertheless consistent with numerical sim- 
ulations [137, 138]. Both leads to a universal traveling wave front structure of the solution, implying scaling 
properties also subasymptotically. 

In order to do phenomenological studies, one can try to extrapolate to finite L and Y the scaling behavior 
found asymptotically. However, this extrapolation is not unique [139]. There is indeed an infinite family of 
scaling variables 



parameterized by 5, which are different from each other at finite L and Y but all converge to the same asymp- 
totic scaling previously mentioned. The parameter 5 seems quite unconstrained, both from the theory and from 
the DIS data, as shown in the phenomenological section of the present contribution. We considered as bench- 
mark points in that family two specific choices of 5. The choice 5 = 1/2 leads to the only scaling variable 
of the family which is a genuine geometric scaling variable, i.e. is equivalent to a scaling with r 2 Q 2 {x). It is 
named running coupling I in the phenomenological section. The choice 5 = 1 leads to the scaling variable 
obtained by substitution of the fixed coupling by the running coupling directly in the original fixed coupling 
geometric scaling variable. It is called running coupling II. 

Finally, one expects scaling properties in any case from evolution equations with saturation, both in 
the non-linear regime, and in a scaling window in the linear regime. In the linear regime, the solution still 
obey the linearized equation, and saturation play only the role of a dynamically generated boundary condition. 
Hence, geometric scaling there, although generated by saturation, is not a hint against the validity of PDF 
fits. However, geometric scaling occurs also in the non-linear regime, where the scaling function is no more a 
solution of the linear BFKL or DGLAP equations. 

2.3.4 DGLAP evolution and the saturation boundary conditions 29 

One of the issues that could be studied in the context of the geometric scaling versus DGLAP evolution is 
the possibility of the different boundary conditions for the DGLAP evolution equations. These boundary 
conditions would incorporate the saturation effects and posses the scaling property. Typically, in the standard 

28 £i ~ —2.34 is the rightmost zero of the Airy function. 
Contributing author: A. M. Stasto 




(28) 



approach, to obtain the solution to the linear DGLAP evolution equations, one imposes the initial conditions 
onto the parton densities at fixed value of Qq and then performs the evolution into the region of larger values 
of Q 2 . However, in the presence of saturation these might not be the correct boundary conditions for DGLAP 
equations. As mentioned earlier the saturation regime is specified by the critical line, the saturation scale 
Q s (x) which is a function of x Bjorken and its value increases as the Bjorken x decreases (or as we go to yet 
higher energies). In that case it seems legitimate to ask, what is the behavior of the DGLAP solutions when 
evolved from the saturation boundary Q 2 = Q 2 (x) rather then from the fixed scale Q 2 = Qq. To answer 
this question we imposed [140] the boundary condition for the gluon density at the saturation scale Q 2 = Q 2 
which possesses the scaling property namely ^xg(x, Q 2 = Q 2 (x)) = ^r°x~ x (in the fixed coupling case). 
The solution for the gluon density at small x (at fixed coupling) which can be derived from solving the DGLAP 
equations with this boundary is given by 



a s xg(x,Q 2 ) 
2vr Q 2 



a s 
2^ 



Q 2 



Q 2 (x) 



(a s /27r)79 9 (w )-l 



(29) 



where ^ gg is the gluon-gluon DGLAP anomalous dimension. This solution clearly has the geometrical scaling 
property as it is only a function of Q 2 /Q 2 (x). It is interesting to note that there exists a critical value of 
the exponent A of the saturation scale which determines the existence of scaling. For example in the double 
leading logarithmic approximation the scaling is present for rather large values of the exponent A > 4a s 7r/3 
whereas there is no scaling for smaller values of A. The formula shown above is however only approximate, 
as in the derivation we included only the leading behavior which should be dominant at asymptotically small 
values of x. At any finite value of x the scaling will be mildly violated by the nonleading terms. We checked 
numerically that this is indeed the case, though the violation was very small. This analysis was extended for 
the case of the more realistic DGLAP evolution with the running coupling. As expected the presence of the 
scale violation due to the running coupling will lead to the violation of the scaling. In this case the geometric 
scaling is only approximate with the solution for the gluon density given by 



a s {Q 2 ) xg(x,Q 2 
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with b being the beta function of the QCD running coupling. The scaling here is present provided we have 
a s (Q s (x)) ln[Q 2 /Q 2 (x)]/ (2-Kb) <C 1. Thus the geometric scaling violating term can be factored out. 

In summary, this analysis shows that the geometric scaling property can be build into the DGLAP initial 
conditions, and that the solution to the linear evolution equation which do not include the parton saturation 
effects can preserve the scaling even in the regime of high Q 2 values, outside the saturation region. 



2. 3. 5 Geometric scaling from DGLAP evolution' 

From the DGLAP point of view there is another possible explanation for geometric scaling: the scaling be- 
haviour can be generated by the evolution itself, rather than being a preserved boundary condition. In fact, 
it is possible to show [141] both analytically and numerically that in the relevant HERA region approxi- 
mate geometric scaling is a feature of the DGLAP evolution. In order to see this, one has first to rewrite 
the DGLAP solution as a function of t — X(t, x) log l/x ("fixed-coupling scaling") or t — A(i, x)^/\og\/x 
("running-coupling scaling") 31 . Then from the explicit form of the DGLAP solution it follows that in the rel- 
evant kinematic region X(t, x) is approximatively constant, leading to aDGLAp{t, x) « a dglap (f — t s (x)). 
Hence approximate geometric scaling in the HERA region is a feature of the DGLAP evolution. Interestingly 
enough, this DGLAP-generated geometric scaling is expected to hold also at large Q 2 and relatively large x 
(say x < 0.1), in contrast with the saturation-based geometric scaling which should be a small x, small (or at 
least moderate) Q 2 effect. 

In order to make more quantitative statements, one can use the quality factor method introduced in Sec. 
12.3.21 As a stalling point, one can consider the leading-order small x DGLAP evolution of a flat boundary 

30 Contributing author: F. Caola 

The labels "fixed-coupling" or "running-coupling" are here a bit misleading. In fact, all the results shown here are obtained with 
the full running-coupling DGLAP solution. We kept this notation only for comparison with saturation-based approaches. 




Fig. 18: Scaling plot with x < 0.1. For the theoretical DGLAP curve, only points with Q 2 > 1 GeV 2 were kept. Curves are offset 
for clarity. 



condition. At the level of accuracy of geometric scaling, this approximation should be accurate enough in a 
wide kinematic region, say Q 2 J> 10 GeV 2 , x < 0.1 at HERA. Now, a quality-factor analysis shows that in 
this region the leading-order small x DGLAP solution has an excellent scaling behaviour, even better than the 
scaling behaviour observed in HERA data. Also the DGLAP predictions for the geometric slope A perfectly 
agree with the phenomenological values: from the DGLAP solution we obtain ^^. LAP = 0.32 ± 0.05 
("fixed- coupling" scaling) and \%g LAP = 1.66 ± 0.34 (" running-coupling" scaling), to be compared with 
^flx = 0-32 ± 0.06, Xrun = 1-62 ± 0.25. Moreover, data exhibit geometric scaling also for larger x, larger 
Q 2 (say x < 0.1 at HERA), as predicted by the DGLAP evolution. All these results are summarized in 
Fig. [jjO where we plot the theoretical and phenomenological 32 reduced cross sections in function of the 
"fixed-coupling" scaling variable lnr = t — Xlnl/x, with A = 0.32, in the HERA region with the cut 
x < 0.1. An analogous plot can be obtained for the "running-coupling" scaling [141]. We interpret these 
results as striking evidence that for Q 2 > 10 GeV 2 the geometric scaling seen at HERA is generated by 
the DGLAP evolution itself, without need of a peculiar saturation ansatz or of a suitable scaling boundary 
condition. 

For Q 2 < 10 GeV 2 the leading-order DGLAP solution exhibits violations of geometric scaling at 
small x. However, in this region any fixed-order DGLAP calculation fails because it does not resum small x 
logarithms. If one consider the DGLAP evolution at the resummed level, geometric scaling reappears quite 
naturally, both in the "fixed-coupling" and "running-coupling" forms [141]. Hence, small x resummation 
extends the region where geometric scaling is expected to values of Q 2 lower than 10 GeV 2 . However at low 
Q 2 sizeable higher twist and non perturbative effects can spoil the universal behaviour of the DGLAP solution. 
In this region hence the HERA scaling could still be generated by some DGLAP evolution, but, differently 
from the Q 2 > 10 GeV 2 region, here there is no strong evidence that this is in fact the case. 



2.3.6 Saturation model and higher twists 

The QCD description of hard scattering processes within the Operator Product Expansion (OPE) approach 
leads to the twist expansion of matrix elements of process-dependent composite operators. Contributions of 
emerging local operators with the increasing twists, r, are suppressed by increasing inverse powers of the hard 
scale, Q 2 . In DIS, at the lowest order (i.e. when the anomalous dimensions vanish), the twist-r contribution 
to the DIS cross section scales as Q~ T . Therefore, at sufficiently large Q 2 it is justified to neglect higher 
twist effects, and retain only the leading twist-2 contribution. This leads to the standard collinear factorisation 
approach with universal parton density functions evolving according to the DGLAP evolution equation. It 
should be kept in mind, however, that the higher twist effects do not vanish completely and that they introduce 
corrections to theoretical predictions based on the DGLAP approach. Thus, the higher twist corrections may 
affect the determination of parton density functions. The importance of these corrections depends on the 
level of precision required and on the kinematic domain. In particular, in the region of very small x the 

32 In fact, in order to make a more flexible analysis, we didn't use the actual HERA data but a neural network interpolation of world 
DIS data [142], As long as one stays in the HERA region the output of the net is totally reliable. 
33 Contributing author: L. Motyka 



higher twist effects are expected to be enhanced, so that they may become significant at moderate Q 2 . Thus, 
it should be useful to obtain reliable estimates of higher twist effects at small x. In this section we shall 
present higher twist corrections to Ft, Fl and F2 structure functions following from the DGLAP improved 
saturation model [143]. The results presented in this section have been obtained in the course of an ongoing 
study [144, 145]. The method applied to perform the twist decomposition of the DGLAP improved saturation 
model is a generalisation of the Mellin space approach proposed in Ref. [146]. 

A rigorous QCD analysis of the higher twist contributions to DIS at high energies is a complex task. 
So far it has been performed for the qqgg operators [147], but the evolution of twist 4 purely gluonic opera- 
tors has not been resolved, — even the proper complete basis of the operators has not been found yet. The 
collinear evolution is known at all twists, however, for so called quasi-partonic operators, for which the twist 
index is equal to the number of partons in the i-channel [148]. Such operators should receive the strongest 
enhancement from the QCD evolution. At the leading logarithmic approximation the collinear evolution of 
quasi-partonic operators is relatively simple — it is given by pair-wise interactions between the partons in the 
t-channel. The interactions are described by the non-forward DGLAP kernel [148]. Within this formalism, the 
evolution of four-gluon quasi-partonic operators was investigated in Ref. [149, 150] in the double logarithmic 
approximation. At small x the scattering amplitudes are driven by exchange of gluons in the t-channel, and 
the quark exchanges are suppressed by powers of x. Thus we shall focus on the dominant contribution of the 
multi-gluon exchanges in the t-channel. In the large iV c -limit, the dominant singularities of the four gluon 
operator are those corresponding to states in which gluons get paired into colour singlet states. In other words, 
the four-gluon operator evolves like a product of two independent gluon densities. In general, for 1/N C — ► 0, 
the 2n-gluon (twist-2n) operator factorizes into the product of n twist-2 gluon densities. After suitable inclu- 
sion of the AGK cutting rules and the symmetry factors of 1/n!, one arrives at the eikonal picture of n-ladder 
exchange between the probe and the target. This is to be contrasted with the Balitsky-Kovchegov picture of 
Pomeron fan diagrams, which was obtained as a result of resummation of the terms enhanced by powers of 
large ln(l/x) rather than by powers of In Q 2 . 

The eikonal form of the multiple scattering was assumed in the saturation model proposed by Golec- 
Biernat and Wiisthoff (GBW) [151, 152]. The dipole cross-section given by Eq.|25]has a natural interpretation 
in terms of a resummation of multiple scattering amplitudes. The scatters are assumed to be independent 
of each other, and the contribution of n scatterings is proportional to [r 2 /R^x)]" 1 . The connection of the 
saturation model to the QCD evolution of quasi-partonic operators is further strengthened by the DGLAP 
improvement of the dipole cross section [143]. In the DGLAP improved saturation model the dipole cross 
section depends on the collinear gluon density, 



where the scale p 2 depends on the dipole size, p 2 = C/r 2 for C/r 2 > p^, and p 2 = p^ for C/r 2 < p^. 
The gluon density applied has been obtained from the LO DGLAP evolution without quarks, with the input 
assumed at the scale p^ 34 . Clearly, in Eq. (l30l) one sees an exact matching between the power of r 2 and the 
power of xg(x, p 2 ) suggesting a correspondence between the term ~ [r 2 a s (p 2 ) xg(x, p 2 )] n in the expansion 
of &(x, r) and the twist-2n contribution to the dipole cross section. Thus, we expect that the saturation model 
approximately represents higher twist contributions in the deep inelastic scattering generated by the gluonic 
quasi-partonic operators. 

The twist analysis of the DIS cross-section must include a treatment of the quark box that mediates the 
coupling of the virtual photon, 7*, to the t-channel gluons. In the dipole model the 7*5 — > qq amplitude, 
computed within QCD, is Fourier transformed (w.r.t. the transverse momentum of the quark) to the coordinate 
representation and appears as the photon wave function, compare Eq. (|25T ). In more detail, one uses the 7*5 
amplitude computed within the /c-r-factorisation framework. This amplitude receives contributions from all 
twists. The twist structure of the quark box is transparent in the space of Mellin moments, and the same is true 




(30) 



34 In the original DGLAP-improved model [143] a different definition of the scale was adopted, [i 1 = C/r 2 + fj,Q, but this choice 
is less convenient for the QCD analysis. 



for the dipole cross-section. Thus we define, 
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It then follows from the Parsival formula that, 

<?t,l(x>Q 2 )= I ^^t,l(-7,Q 2 )<^,7)- (33) 
Jc 2m 

For the massless quark case one has Ht,l{i, Q 2 ) = Ht,l(i) Q~ 2j - The contour of integration, C, in Eq. [331 
belongs to the fundamental Mellin strip, — 1 < Re 7 < 0. 

In order to obtain the twist expansion of a, one extends the contour C in the complex 7-plane into a 
contour C closed from the left-hand side. The Mellin integral in Eq. [33] may be then decomposed into con- 
tributions coming from singularities of Ht,l(—^, Q 2 ) &{x, 7). The function Ht(—^) (Hl{— 7)) has simple 
poles at all negative integer values of 7, except of 7 = —2 (7 = —1), where Ht (Hl) is regular. The singular- 
ity structure of the dipole cross section, a {7), depends on the specific form of a(x, r 2 ). For a(x, r 2 ) used in 
the GBW model, the a(x, 7) has simple poles at all negative integers 7's. For the DGLAP improved form of 
a given by (I3TT) . a(x, 7) has cut singularities that extend to the left from 7 = k where k = —1, —2, etc. The 
leading behaviour of a around a branch point at 7 = k is given by ~ (7 — k) p ( k \ where the exponent p(k) 
is generated by the DGLAP evolution. As the cuts extend to the left from the branch points, the dominant 
contribution to the cross section at the given twist comes from the vicinity of the corresponding branch point. 

The singularity structure of the quark box part Ht,l{i) plays the crucial role in understanding the 
strength of the subleading twist effects. To see that one expands Ht.l(i) around the singular points, 7 = 1 
and 7 = 2 (recall that the argument of Ht,l is —7 in the Parsival formula (l33l): 
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for twist-2, and 
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H T (l) = ^ + bP +0(7-1), H L (<y) =bf + 0(7-1), (34) 



,( 4 ) 



H T (l) = bf + 0( 7 - 2), H L (l) = ^ + b { L ] + 0(7 " 2), (35) 

for twist-4. The singular 1/(7 — 1) and 1/(7 — 2) terms in (l34l) and (l35l) generate an additional enhancement, 
~ ln(Q 2 ), of the corresponding twist-2 and twist-4 contributions to the DIS cross-section. The constant 
pieces, proportional to b^ L and b^ L , produce no new logarithms (thus they are interpreted as the next- to- 
leading order (NLO) QCD corrections) and the higher terms in the Laurent expansion give yet higher orders 

in the perturbative expansion of the g — > q splitting functions and to the coefficient functions. We summarize 

(2) 

this discussion by displaying below the most leading contributions to &t,l at twist-2 (a T L ) and at twist-4 
^t i) obtained in the DGLAP improved saturation model: 



(2) „q2 , n , 2 , (2) 

4 2) ~ ~q2 J ^« S (Q' 2 )* 9 (*,Q' 2 ), 4 2) ~ -^a s {Q 2 )xg{x,Q 2 ), (36) 



for twist-2, and 



4 4) ~ b ^s(Q 2 )xg(x,Q 2 )] 2 , of ~ ^ j® 2 ^[a s (Q> 2 )xg(x,Q' 2 )] 2 , (37) 

for twist-4. These results imply that the the relative twist-4 correction to Ft is strongly suppressed w.r.t. the 
twist-2 contribution, as the subleading twist-4 term in Ft appears only at the NLO. On the contrary, for Fl, 



the leading twist term enters only at the NLO, and the the twist-4 correction enters at the leading order. So, the 
relative twist-4 effects in Ft are expected to be enhanced. Note, that both in the case of Ft and Fl the twist-4 
effects are enhanced w.r.t. the twist-2 contribution by an additional power of the gluon density, xg(x,Q 2 ). 
For the structure function F2 = Ft + Ft we expect small relative corrections from the higher twists because 



of the opposite sign of coefficients and b^' , that leads to cancellations between the twist-4 contributions 
from Ft and Ft at moderate Q 2 . These conclusions about the importance of the higher twist corrections are 
expected to be quite general, because they follow directly from the twist structure of the quark box and do not 
depend on the detailed form of the twist-4 gluon distribution. 

We performed [144, 145] an explicit numerical evalua- 
tion of the twist-4 corrections to Ft, Ft and F2 in the DGLAP 
improved saturation model, and compared the results to results 
obtained [146] within the GBW model without the DGLAP evo- 
lution. The parameters of the DGLAP model were fitted to de- 
scribe all F2 data at small x. In the model we took into account 
three massless quark flavours and the massive charm quark. The 
twist analysis, however, has been, so far, performed only for 
the massless quark contribution. The obtained relative twist-4 
corrections to Ft, Ft and F2 are displayed in Fig. 12.3.61 as a 
function of Q 2 , for x = 3 • 1CT 4 . The continuous curves corre- 
spond to the GBW model [146], and the dashed ones have been 
obtained [144, 145] in the DGLAP improved saturation model. 
Although there are some quantitative differences between the 
models, the qualitative picture is quite consistent and confirms 
the results of the analytic analysis outlined above. Thus, the 
higher twist corrections are strongest in Ft, and much weaker 

in Ft- In F2 there occurs a rather fine cancellation between the 
Fig. 19: The ratio of twist-4 to twist-2 components of . . . . . ^ , ,-, . „o , . „ 

twist-4 contributions to Ft and Fl, at all Q , down to 1 GeV . 
F T , F l and F 2 at x = 3 ■ 1CT 4 in the GBW model . , , , _ _ , . , . , , ■ ■„ 

Although an effect of this kind was expected, it still remains 
(continuous lines) and in the DGLAP improved satu- . . . . . „ . , „ „ r 

somewhat surprising that this cancellation works so well. We 
ration model (dashed lines). . . r _ _ „_a . . , . 

estimate that, for x = 3 ■ 10 , the twist-4 relative correction 

to F 2 is 2-4% at Q 2 = 10 GeV 2 , and smaller than 10% for all Q 2 down to 1 GeV 2 . For F L , the relative 
correction is ~ 20% at Q 2 = 10 GeV 2 , and strongly increases with the decreasing scale, reaching ~ 50% at 
Q 2 = 1 GeV 2 . It implies that the determination of partem densities from twist-2 F2 data is safe even at small x 
and moderate Q 2 . On the other hand Fl at small x may provide a sensitive probe of higher twist effects and 
partem saturation. 



Twist ratios: tw-4/tw-2 




2.3.7 Conclusions 

There are many possible explanations for the scaling properties of HERA data, some of them based on sat- 
uration effects and some others based on pure linear evolution. In order to separate between these different 
explanations, it is fundamental to specify a kinematic window. 

In particular, for large enough Q 2 and not too small x (say Q 2 Si 10 GeV 2 in the HERA region) the 
observed geometric scaling is determined by the DGLAP evolution, irrespective of the boundary condition. 
For smaller values of Q 2 , the evolution of partem densities is still linear, but is sensitive to a boundary condition. 
In an evolution toward smaller x, like BFKL, this boundary condition is dynamically generated by saturation, 
and it leads to the geometric scaling window. It is possible to take these effects into account also in a Q 2 
evolution, like DGLAP, by imposing as initial condition the same boundary condition. We have seen that, 
in this case, even the LO DGLAP equation is able to propagate geometric scaling towards larger Q 2 . In 
that domain, although geometric scaling may arise as saturation effect, the evolution is still linear, and thus 
compatible with standard PDFs analysis. However, at yet lower Q 2 and x standard linear evolution is no 
longer reliable. In particular, for Q 2 smaller than a x dependent saturation scale Q s (x), the evolution of 
parton densities becomes fully nonlinear, and this spoils the actual determination of the PDFs. Results from 



inclusive diffraction and vector meson exclusive production at HERA, and from dA collisions at RHIC all 
suggest that in the kinematic accessible x region Q s ~ 1 — 2 GeV. 

In conclusion, we can say that for large enough Q 2 10 GeV 2 geometric scaling is fully compatible 
with linear DGLAP evolution. For smaller Q 2 the situation becomes more involved. For Q 2 > 5 GeV 2 
the HERA scaling is still compatible with DGLAP, maybe with some small x resummation or some suitable 
boundary condition. However, other effects may be relevant in this region. For yet lower Q 2 and x the linear 
theory becomes unreliable and saturation could be the right explanation for geometric scaling. Unfortunately 
at HERA we have too few data for a definitive explanation of geometric scaling in the very small x region, 
since many different approaches lead approximatively to the same results and it is very difficult to separate 
among them. For example, in the low x region both saturation and perturbative resummations lead to a 
decrease of the gluon and to geometric scaling. At the LHC, where higher center-of-mass energy is available, 
the x region is significantly extended down to very small values. Especially in the fragmentation region the 
typical values of x which can be probed can reach down to 10~ 6 for partons with transverse momenta of about 
few GeV. This fact combined with the very wide rapidity coverage of the main LHC detectors opens up a 
completely new window for the study of parton saturation, and its relations with geometric scaling and linear 
evolution will possibly be clarified. 



3 BENCHMARKING OF PARTON DISTRIBUTIONS AND THEIR UNCERTAINTIES 35 
3.1 Introduction 

The proper treatment of uncertainties associated to the fit of Partem Distribution Functions (PDF) has become 
a subject of great interest in the last few years. A simple way of understanding differences between available 
approaches to partem fits is to fix some hypothesis (say, experimental data, QCD parameters, input parame- 
terizations, error treatment), and check what is the effect of the remaining assumptions. Such studies were 
previously done in the framework of the first HERA-LHC workshop [1]. 

In the following we will discuss three benchmark fits. The first one is presented in Sect. 13.21 It is 
based on the H12000 partem fit [18], and it compares a new version of this fit, in which uncertainty bands are 
determined [153, 154] using a Monte Carlo method, to the reference fit, where uncertainty bands are obtained 
using the standard Hessian method. The main motivation of this benchmark is to study the impact of possible 
non-Gaussian behaviour of the data and, more generally, the dependence on the error treatment. 

The second benchmark is presented in Sect. 13.31 It is based on the study performed by S. Alekhin and 
R. Thorne in Ref. [1], which compared the fits by their respective groups to a common reduced set of data with 
common assumptions, and also to their respective reference (global) fits. This comparison is extended here in 
two ways. First, the comparison is extended to include an NNPDF fit to the same reduced set of data with the 
same assumptions, and the NNPDF1.0 reference fit [155]. Second, results are also compared to a fit based on 
the recent MSTW 2008 [39, 156] analysis. As in the Thorne benchmark fit, this uses slightly different data sets 
and assumptions; it is furthermore modified to use the same input parameterization and improved treatment 
of uncertainties as MSTW. The main purpose of these comparisons is to answer the questions (a) to which 
extent fit results from various groups obtained using different methodologies still differ from each other when 
common or similar assumptions and a common or similar reduced dataset are used and (b) how the fits to the 
reduced dataset by each group compare to the fit to the full dataset. 

The third benchmark, discussed in Sect. 13.41 is a further elaboration on the benchmark presented in 
Sect. 13.21 extended to include the NNPDF fit, which also uses a Monte Carlo approach. The main purpose 
of this benchmark is to compare two fits (HI and NNPDF) which have the same error treatment but different 
partem parameterizations. The inclusion in this benchmark of the NNPDF fit is also interesting because it 
allows a comparison of a fit based on a very consistent set of data coming from the HI collaboration only, to 
fits which include all DIS data sets, which are less compatible than the HI sets alone. 

3.1.1 Settings for the HI benchmark 

This analysis is based on all the DIS inclusive data by the HI collaboration from the HERA-I run. A kinematic 
cut of Q 2 > 3.5 GeV 2 is applied to avoid any higher twist effect. The data points used in the analysis are 
summarized in Table Q] and Fig. [20] 

The theoretical assumptions are: 

• NLO perturbative QCD in the MS renormalization and factorization scheme; 

• zero-mass variable flavour number scheme with quark masses m c = 1.4 GeV and = 4.5 GeV; 

• the strong coupling fixed to a s (Mz) = 0.1185; 

• momentum and valence sum rules enforced; 

• starting scale for the evolution at Ql = 4 GeV 2 ; 

• strange contribution fixed as 

s(x, Ql) = s(x, Ql) = f s D(x, Ql) = Y^yd(x, Q 2 ), (38) 

with U = u + c and D = d + s + b and with f s = 0.33; 

• charm contribution fixed as 

c(x, Ql) = c(x, Ql) = f c U{x, Ql) = Qo), (39) 

35 Contributing authors: R. D. Ball, L. Del Debbio, J. Feltesse, S. Forte, A. Glazov, A. Guffanti, J. I. Latorre, A. Piccione, V. Rade- 
scu, J. Rojo, R. S. Thorne, M. Ubiali, G. Watt 



Data Set 


Data points 


Observable 


Ref. 


H197mb 


35 


a Na >+ 


[89] 


H1971owQ2 


80 


a NC >+ 


[89] 


H197NC 


130 


B NC,+ 


[157] 


H197CC 


25 


a cc >+ 


[157] 


H199NC 


126 


a NC >~ 


[88] 


H199CC 


28 




[88] 


H199NChy 


13 


a NC >~ 


[88] 


H100NC 


147 


a NC '+ 


[18] 


HIOOCC 


28 


a cc >+ 


[18] 


Total 


612 





Table 1: Data points used in the HI benchmark after kinematic cuts of Q 2 > 3.5 GeV 2 . 




Fig. 20: The data used in the HI benchmark and in the NNPDF reference fit. 

with/ c = 0.15; 

• five independent PDFs: gluon and U, D, U, D (see definition above); 

• iterated solution for evolution (see, e.g. [158], Sect. 1.3). 
Both the HI and NNPDF methodologies are based on 

• Monte Carlo method to determine uncertainties. This method will be discussed in detail in Sect. 13.2.21 
below. 

They differ in the way PDFs are parameterized: 

• HI parameterizes PDFs as 

xg{x,Ql) = AgX B «(l-x) c '[l + D g x], 
xU{x,Ql) = Aux^il-xfvll+Dux + Fux 3 ], 
xD{x,Q%) = A d x Bd (1-x) Cd [1 + D d x], (40) 
xtJ{x,Ql) = A D x B v (1 - xfv , 
xD(x,Q%) = AijX B °(l-x) c v , 

(41) 

which yields 10 free parameters after sum rules are imposed; 

• NNPDF parameterizes PDFs with a 2-5-3-1 neural network, which implies 185 free parameters to be 
fitted. 

Because of the large number of parameters, the minimum of the NNPDF fit is determined using the stop- 
ping criterion discussed in Sect. 13.3.21 below, while the minimum of the HI fit is determined as the standard 
minimum x 2 (or maximum likelihood) point of parameter space. 



3.1.2 Settings for the HERA-LHC benchmark 

This benchmark was first presented in Ref. [1], where its settings were defined. In order to have a conservative 
ensemble of experimental data and observables, only structure function DIS data are used. Large kinematic 
cuts are applied to avoid any higher twist effect. The data points used in the Alekhin analysis are summarized 
in Table E| and Fig. E2 



Data Set 


Data points 


Observable 


Ref. 


ZEUS 97 


206 




[91] 


Hllowx97 


77 




[89] 


NMC 


95 




[116] 


NMC_pd 


73 




[159] 


BCDMS 


322 


Fl 


[160] 


Total 


773 





Table 2: Data points used in the HERA-LHC benchmark after kinematic cuts of Q 2 > 9 GeV 2 and W 2 > 15 GeV 2 are applied. 




Fig. 21: The data used in the HERA-LHC benchmark and in the NNPDF reference fit. 



The theoretical assumptions are: 

• NLO perturbative QCD in the MS renormalization and factorization scheme; 

• zero-mass variable flavour number scheme with quark masses m c = 1.5 GeV and m\, = 4.5 GeV; 

• a s (M z ) fitted: the best-fit values are 0.1110 ± 0.0012 (Alekhin) and 0.1132 ± 0.0015 (Thorne); 

• momentum and valence sum rules imposed; 

• starting scale for evolution Qq = 1 GeV 2 ; 

• four independent input PDFs (u and d valence, the sea and the gluon); 

• no light sea asymmetry: u = d; 

• no independent strange PDF: 

s(x, Ql) + s(x, Ql) = 0.5(u(x, Q 2 ) + d(x, Q 2 )) ; (42) 

• iterated solution of evolution equations; 

The NNPDF analysis presented here is based on the same data set and theoretical assumptions, the only 
difference being that the strong coupling is fixed to a s (Mz) = 0.112, i.e. the average of the fitted values of 
S. Alekhin and R. Thorne. 

The Thorne benchmark used somewhat different data sets and assumptions. Namely: 



Data Set 


Data points 


Observable 


Ref. 


ZEUS 97 


206 


a N(J -+ 


[91] 


Hllowx97 


86 


a NC '+ 


[89] 


NMC 


67 


Fl 


[116] 


NMC_pd 


73 




[159] 


BCDMS 


157 


Fl 


[160] 


Total 


589 





Table 3: Data points used in the MSTW benchmark fit after kinematic cuts of Q 2 > 9 GeV 2 and W 2 > 15 GeV 2 are applied. 

• A somewhat different dataset is used, as displayed in Table [3] This differs from the dataset of Table [2] 
and Figure |2] because the NMC and BCDMS fixed-target data on F% used are averaged over different 
beam energies, and also, HERA reduced cross sections rather than structure function data are used, 
resulting in an additional nine HI points. Note that the Thorne benchmark in Ref. [1] also included the 
F 2 d BCDMS deuterium data. 

• All correlations between systematics are neglected, and statistical and systematic errors are added in 
quadrature. 

• Normalizations of individual data sets are fitted with a rescaling of uncertainties to avoid systematic 
bias. 

• The F2/F2 data are corrected for nuclear shadowing effects [161]. 

The MSTW analysis presented here makes the same choices as the Thorne benchmark, but with a s (Mz) = 
0.112, and additionally 

• a global correction of —3.4% is applied to the luminosity of the published HI MB 97 data [89] following 
a luminosity reanalysis [162]. 

• a quartic penalty term in the \ 2 definition is given to normalizations which deviate from the central 
value. 

3.2 Experimental Error Propagation 36 

3. 2. 1 Introduction 

Standard error estimation of proton partem distribution functions (PDFs) relies on the assumption that all 
errors follow Gaussian (or normal) statistics. However, this assumption may not always be correct. Some sys- 
tematic uncertainties such as luminosity and detector acceptance follow rather a log-normal distribution (see 
Section |4~TI ). Compared to the Gaussian case, the lognormal distribution which has the same mean and root 
mean square (RMS), is asymmetric and has a shifted peak, as shown illustratively in Figure [22] Therefore, 
the non-Gaussian behaviour of the experimental uncertainties could lead to an additional uncertainty of the 
resulting PDFs. An alternative to the standard error propagation is a toy Monte Carlo (MC) method. Here, 
an implementation of the MC method is presented for estimation of the PDF uncertainties with various as- 
sumptions for the eiTor distribution. In addition, this MC method provides an independent cross check of the 
standard error propagation when assuming the Gaussian error distributions. 

3.2.2 Method 

The Monte Carlo technique consists firstly in preparing replicas of the initial data sets which have the central 
value of the cross sections, <7j, fluctuating within its systematic and statistical uncertainties taking into account 
all point to point correlations. Various assumptions can be considered for the error distributions. When dealing 
with the statistical and point to point uncorrected errors, one could allow each data point to randomly fluctuate 
within its uncorrected uncertainty assuming either Gauss, lognormal, or any other desired form of the error 
distribution. For example, for Gaussian errors 

o-i — tn (1 + 5™ r • Ri) , (43) 
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where 5V- ncorr corresponds to the uncorrelated uncertainties and Ri is a random number chosen from a normal 
distribution with a mean of and a standard deviation of 1. Hence, the central value of each cross section 
point i is shifted by Sf ncorr ■ R { . 

For the systematic errors, the treatment is a bit more complicated than above. This is due to the cor- 
relation between data points and that, in general, the data points are sensitive to the systematic sources with 
a different strength 6ij, where index i (j) runs over all the cross section points (all systematic sources). In 
order to take this into account, for each systematic source j a uniformly distributed fluctuation probability Pj 
is selected. Then, for each data point i the central value of cross section is shifted such that probability of this 
shift, which depends on <5y and the exact form of the probability distribution function, is equal Pj (for positive 
Sij) or (1 — Pj) (for negative <5y). In other words, each central value of the cross section is shifted with the 
same probability of the corresponding systematic shift. For example for the Gaussian errors, this procedure is 
equivalent to 

(N sys \ 
i + sr corr -Ri+Yl 5 T ■ R i h < 44) 

where in addition to the shifts for the uncorrelated errors previously explained, Rj corresponds to another 
random number chosen from a normal distribution with mean of and standard deviation of 1 as a fluctuation 
for the systematic source j. Hence, the central values of the cross sections are shifted in addition by 5fj Tr ■ Rj 
for each systematic shift. 

This preparation of the data is repeated for N times, where high statistics is desirable for more accurate 
results. For this study we used N > 100 which proved to suffice. For each replica, a next to leading order 
(NLO) QCD fit is performed to extract the PDFs. The errors on the PDFs are estimated from the RMS of the 



spread of the N lines corresponding to the N individual fits to extract PDF. 

A fit to the published HI HERA-I data of neutral and charged current e^p scattering cross sections [18] 
using the settings discussed in Sect. 13.1 .11 has been performed, using the QCDNUM program [163]. 

3.2.3 Validation of the Method 

The MC method is tested by comparing the standard error estimation of the PDF uncertainties with the MC 
techniques by assuming that all the errors (statistical and systematic) follow Gaussian (normal) distribution. 
Figure [23] shows good agreement between the methods. 



Fit vs H1 PDF2000, Q 2 = 4. GeV 2 




X 

Fig. 23: Comparison between the standard error calculations and the Gauss error distribution is shown for the gluon PDF. Green lines 
represent the spread of Monte Carlo generated allowances for the errors, and the red lines are the RMS of this spread. The black lines 
correspond to the standard error calculations of the PDF errors. 



3.2.4 Test of various assumptions for the error distributions 

Two cases are considered which may represent most likely the error distributions: (1) the lognormal distribu- 
tion for the luminosity uncertainty and the rest of the errors are set to follow the Gauss shape, (2) the lognormal 
distributions for all the systematic errors and the statistical errors are set to follow the Gauss distributions. The 
results for the first case (1) are shown in Figure [24] The results of the tests for the case when lognormal 
distributions for all the systematic uncertainties are assumed is shown in Figure [24] We observe that for the 
precise HI HERA-1 data the effect of using lognormal distribution, which is considered for some systematic 
uncertainties more physical, is similar to the pure gauss distribution case. 

3.2.5 Conclusions 

A simple method to estimate PDF uncertainties has been built within QCD Fit framework. Assuming only 
gauss distribution of all errors, the results agree well with the standard error estimation. This method allows to 
check the effect of non- gauss assumptions for distributions of the experimental uncertainties. For the HI data, 
results are similar to the gauss case when using lognormal. The method could be extended for other physical 
variables (i.e. cross sections) for cross checks with the standard error evaluation. 

3.3 HERA-LHC Benchmark 

This benchmark is based on the Alekhin/Thorne benchmark of Ref. [1], whose settings has been given in 
Sect. 13. 1.2] Both the Alekhin and Thorne fits had the following features: 
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Fig. 24: Comparison between errors on PDFs obtained via standard error calculation (black) where Gauss assumption is used, and 
errors obtained via Monte Carlo method (red) where luminosity uncertainty is allowed to fluctuate according to lognormal distributions 
and all the other uncertainties follow the Gaussian distribution (left), and where all the systematic uncertainties are allowed to fluctuate 
according to lognormal distributions (right). Only the gluon PDF is shown, where the errors are larger. The green lines show the spread 
of the N individual fits. 

• uncertainties determined using the Hessian method with Ax 2 = 1", 

• input PDFs are parameterized using the following functional form: 

xf i (x,Q 2 ) = A i (l-x) b *(l + e i x°- 5 + li x)x a *. (45) 

with 6j and ji set to zero for the sea and gluon distributions. Hence, there were a total of 13 free PDF 
parameters plus a s (Mz) after imposing sum rules. 

Here, we reanalyze it within the MSTW and NNPDF approaches. First, we summarize the respective 
MSTW and NNPDF approaches, and especially their differences when compared to the previous HERALHC 
benchmark fits of Ref. [1]. Then, results for benchmark fits obtained with the various different approaches are 
compared to each other. Finally, we compare each benchmark fit to its counterpart based on a wider range of 
data, i.e. the NNPDF1.0 [155] reference and the MRST01 [164] and MSTW08 [39, 156] PDFs. 

3. 3. 1 MSTW approach 3,1 

The benchmark analysis is now much more closely aligned to the global analysis than was the case for the 
Thorne benchmark compared to the MRST global analysis. It follows the general approach taken by the 
MRST (or more recently, MSTW) group, and is similar to that described in Ref. [164]. There are some new 
features which are explained below. 

- Input parameterization. We take the input PDF parameterization at Qq = 1 GeV 2 to be: 



xu v (x,Q 2 ) = A u x^(l-x)^(l + e u ^ + lu x), (46) 

xd v {x,Q 2 ) = A d x^(l-x)^{l + e d ^ + ld x), (47) 

xS(x,Q 2 ) = A s x 5s (l-x) r >s(l + esVx~ + 7sx), (48) 

xg{x,Ql) = A g x 5 °(l-x) r > ! >(l + e g Vx~ + 'y g x)+A g ,x s <>'(l-x) r >9' , (49) 
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where S = 2{u + d + s), s = s = 0.1 S and d = u. The parameters A u , Ad and A g are fixed by 
sum rules, leaving potentially 19 free parameters. In practice, to reduce the number of highly correlated 
parameters, making linear error propagation unreliable, we determine the central value of the benchmark 
fit by freeing all 19 parameters, then fix 6 of those at the best-fit values when calculating the Hessian 
matrix used to determine the PDF uncertainties, giving a total of 13 eigenvectors. This is the same 
procedure as used in the MSTW 2008 global fit [39, 156], where there are an additional 3 free parameters 
associated with d — u and an additional 4 free parameters associated with strangeness, giving a total of 
20 eigenvectors. Note that the parameterization used in the previous Alekhin/Thorne benchmark fits was 
considerably more restrictive, where the es, 7s, e g and j g parameters were set to zero, and the second 
(negative) gluon term was omitted entirely. In addition, e u was held fixed for the Thorne benchmark 
fit, leaving a total of 12 eigenvectors. We find that the more flexible gluon parameterization, allowing 
it to go negative at very small x, is very highly correlated with the value obtained for a s , and a value 
of a s (Mz) = 0.105 is obtained if it is allowed to go free at the same time as the other parameters, 
therefore we instead choose to fix it at a s (Mz) =0.112 as in the NNPDF benchmark fit. 

- Error propagation. Apart from the more flexible input parameterization, the other major difference in 
the new MSTW version of the HERA-LHC benchmark fit, with respect to the previous Thorne (MRST) 
version, is the choice of tolerance, T = y Ax 2 - The MRST benchmark fit used the standard choice 
T = 1 for one-sigma uncertainties. More precisely, the distance t along each normalized eigenvector 
direction was taken to be 1, and ideal quadratic behaviour about the minimum was assumed, giving 
T « t = 1. The MRST global fit used T = \/50 for a 90% confidence level (C.L.) uncertainty 
band; however, this is not appropriate when fitting a smaller number of data sets. Recently, a new 
procedure has been developed [39, 156] which enables a dynamic determination of the tolerance for 
each eigenvector direction, by demanding that each data set must be described within its one-sigma 
(or 90%) C.L. limits according to a hypothesis-testing criterion, after rescaling the \ 2 f° r eacn data set 
so that the value at the global minimum corresponds to the most probable value. Application of this 
procedure to the MSTW benchmark fit gives T ~ 3 for one-sigma uncertainties and T ~ 5 for 90% 
C.L. uncertainties. For the MSTW global fit, the typical values of T required are slightly larger, with 
more variation between different eigenvector directions. The increase in T in the global fit is mainly due 
to the inclusion of some less compatible data sets, while the greater variation in T between eigenvectors 
is due to the fact that some parameters, particularly those associated with s and s, are constrained by 
far fewer data sets than others. In the MSTW fits, the data set normalizations are allowed to vary, 
with the aforementioned penalty term, when determining the PDF uncertainties. For global fits this 
automatically leads to a small increase in uncertainty compared to the MRST determinations, where data 
set normalisations were held fixed when calculating the Hessian matrix used for error propagation. In 
the MRST benchmark fit the data set normalizations were allowed to vary. To calculate the uncertainty 
bands from the eigenvector PDF sets, we use the formula for asymmetric errors given, for example, in 
Eq. (13) of Ref. [164]. 

3. 3. 2 NNPDF approach 3 * 

The NNPDF approach was proposed in Ref. [165], and it was applied there and in Ref. [142] to the param- 
eterization of the structure function ^(x, Q 2 ) with only two or more experimental data sets respectively. In 
Ref. [166] it was first used for the determination of a single PDF (the isotriplet quark distribution), and in 
Ref. [155] a full set of PDFs fit based on DIS data (NNPDF1.0) was presented. Because the method has been 
discussed extensively in these references, here we only summarize briefly its main features. 

- Error propagation. We make a Monte Carlo sample of the probability distribution of the experimental 
data by generating an ensemble of N replicas of artificial data following a multi-gaussian distribution 
centered on each data point with full inclusion of the experimental covariance matrix. Each replica is 
used to construct a set of PDFs, thereby propagating the statistical properties of the data Monte Carlo 
sample to a final Monte Carlo sample of PDFs. Here we shall take N = 100. The method is the same 
as discussed in Sect. I3.2.2L the only difference being the treatment of normalization errors: relative 
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normalizations are fitted in the HI approach, while they are included among the systematic errors in the 
Monte Carlo data generation in the NNPDF approach (see Refs. [18, 155] for details of the respective 
procedures) . 

- Input parameterization. Each PDF is parameterized with a functional form provided by a neural net- 
work. The architecture for the neural network is the same for all PDFs, and yields a parameterization 
with 37 free parameters for each PDF This is a very redundant parameterization, it is chosen in or- 
der to avoid parameterization bias; neural networks are a particularly convenient way of dealing with 
redundant parameterizations. Note that sum rules are also imposed. 

- Minimization. A redundant parameterization allows for fitting not only the underlying physical be- 
haviour, but also statistical noise. Therefore, the minimization is stopped not at the absolute minimum 
of the x 2 , but rather before one starts fitting noise. This optimal stopping point is determined as follows: 
the data in each replica are randomly assigned either to a training or to a validation set. The fit is per- 
formed on data of the training set only, while the validation set is used as a monitor. The fit is stopped 
when the quality of the fit to the training set keeps improving, but the quality of the fit to the validation 
set deteriorates. 

3.3.3 Comparison between the Benchmark Parton Distributions 



Data Set 


Xbench/^data 


Xglobal/^data 


ZEUS 97 


1.09 


1.18 


Hllowx97 


1.03 


1.00 


NMC 


1.40 


1.45 


NMC_pd 


1.24 


1.32 


BCDMS 


1.21 


1.98 


Total 


1.19 


1.53 



Table 4: NNPDF \ 2 f° r the total and each single data set, both for the benchmark and global fit. 



Data set 


diag ' l 1 at 
Xbench /- /v data 


diag 2 / a r 
^global / iv data 


ZEUS 97 


0.76 


0.79 


Hllowx97 


0.53 


0.54 


NMC 


1.08 


1.11 


NMC_pd 


0.78 


0.89 


BCDMS 


0.74 


1.13 


Total 


0.76 


0.89 



Table 5: MSTW \ 2 for the total and each single data set, both for the benchmark and global fit. Notice that statistical and systematic 
errors are added in quadrature and that relative data set normalizations are fitted. 

The x 2 P er data point for the NNPDF and MSTW fits are shown in Table |4] and [5] respectively. Note 
that in the MSTW fit statistical and systematic errors are added in quadrature, so the quantity shown is the 
diagonal contribution to the \ 2 - The quality of the NNPDF is seen to be uniformly good. The quality of the 
MSTW is also uniform, though it cannot be compared directly because of the different way systematics are 
treated. The comparison of each benchmark fit to the corresponding global fit will be discussed in Sect. 13.3.41 
below. 

In Fig.|25]the PDFs from the NNPDF and MSTW benchmark fits presented here are compared to those 
by Thorne from Ref. [1] at the same reference scale of Q 2 = 20 GeV used there (denoted as MRST01 in 
the figure). The benchmark fit by Alekhin [1] is not shown as the PDFs are very close to the those by Thorne 
displayed in Fig. [25] 




For PDFs and kinematical regions where data are available, namely the small- x gluon and sea quark 
and the large-x u v distributions, the central values of the NNPDF fit are quite close to those of the MRST 
and MSTW fits, despite the differences in methodology. The central values of the PDFs are slightly different 
for the MRST and MSTW benchmark fits due to the use of BCDMS F 2 d data in the former, which affects 
mainly valence quarks. Where extrapolation is needed, such as for the d v distribution, which is constrained 
only by the small amount of data on the ratio F^jF^, or the large-x sea quark, central values are rather more 
different (though the Alekhin/MRST/MSTW benchmark central values are within the NNPDF error band). 
The exception is the smallest- x gluon, where the form of the MSTW parameterization results in a very sharp 
turn-over. However, even here the uncertainty bands are close to overlapping. 

Differences are sizeable in the estimation of uncertainties. Firstly, uncertainty bands for NNPDF bench- 
mark are significantly larger than for the MSTW benchmark, which in turn are in general somewhat larger 
than those for the MRST benchmark. The difference between MRST and MSTW, which are based on similar 
methodology, is due to use of a dynamic tolerance and a more flexible gluon parameterization in MSTW (see 
Sect. 13-3- 1 b - Secondly, the width of the uncertainty band for NNPDF benchmark varies rather more than that 
of the MRST benchmark according to the PDF and the kinematic region, though this is not quite so much the 
case comparing to MSTW benchmark. Indeed, the NNPDF uncertainties are quite small in the region between 
x = 0.01 and x = 0.1 (where there is the bulk of HERA and fixed-target data), while they blow up in the 
large-x region for the sea quark or the small-x gluon, where there is less or no experimental information. The 
smallness of the uncertainty band for MSTW for the small-x valence quarks may be partially due to the lack 
of flexibility in the parameterization: note that because of sum rules, the size of uncertainties in the data and 



extrapolation region are correlated. 

Finally, the MRST7MSTW central value generally falls within the NNPDF uncertainty band, but the 
NNPDF central value tends to fall outside the MRST/MSTW uncertainty band whenever the central values 
differ significantly. 

3.3.4 Comparison of the Benchmark Parton Distributions and Global Fits 
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Fig. 26: Comparison of the NNPDF benchmark and reference fits for the gluon, d-sea, tt-valence and d- valence at Q 2 — 20 GeV 2 . 



In Fig. [26]we compare the NNPDF benchmark fit to the NNPDF1.0 reference fit of Ref. [155] (NNPDF 
global, henceforth), while in Fig. |27] we compare the MSTW benchmark fit to the MRST01 [164] (MRST 
global, henceforth) and MSTW08 [39, 156] global fits (MSTW global, henceforth). 

The x 2 of the NNPDF benchmark and global fits are compared in Table 01 while those of the MSTW 
benchmark and global fits are compared in Table [5] Note that for the NNPDF fits the x 2 is computed us- 
ing the full covariance matrix, while for the MSTW fits systematic and statistical uncertainties are added in 
quadrature. Note also that the MRST and MSTW global fits are carried out in a general-mass variable flavour 
number scheme rather than the zero-mass variable flavour number scheme used in the corresponding bench- 
mark fits, whereas for NNPDF both global and benchmark fits are done with a zero-mass variable flavour 
number scheme. Comparison of the quality of each benchmark to the corresponding global fit to the same 
points in Table [5] shows a significant deterioration in the quality of the fit (total Ax 2 ^> 1), especially for the 
BCDMS i<2 data. All fits appear to be acceptable for all data sets: for instance, even though the x 2 of the 
NNPDF global fit for the benchmark subset of data is 1.98, it is equal to 1.59 [155] for the full BCDMS set of 
data. However, the increase in x 2 suggests that there might be data inconsistencies. 




Let us now compare each pair of benchmark and global fits. For NNPDF, the difference in central 
value between benchmark and reference is comparable to that found between the MRST or Alekhin global 
fits and their benchmark counterparts in Ref. [1]. However, the NNPDF global and benchmark fits remain 
compatible within their respective error bands. Indeed, the NNPDF benchmark fit has a rather larger error 
band than the reference, as one would expect from a fit based on a rather smaller set of (compatible) data. 
Such a behaviour was however not observed in the comparison between global and benchmark MRST and 
Alekhin fits of Ref. [1]. 

It is interesting to observe that the gluon shape at low x of the benchmark and global NNPDF disagree 
at the one a level (though they agree at two a). This can be understood as a consequence of the fact that the 
value of a s in the two fits is sizably different (a s = 0.112 vs. a s = 0.119). Theoretical uncertainties related 
to the value of a s were shown in Ref. [155] to be negligible and thus not included in the NNPDF error band, 
but of course they become relevant if a s is varied by several standard deviations (3.5 a, in this case). 

Coming now to MSTW, we first notice that, as discussed in Sect. 13.3.31 the MSTW benchmark set has 
somewhat larger uncertainty bands than the MRST benchmark set and thus also than each of the sets obtained 
from global fits. Consequently, the MSTW benchmark PDFs are generally far more consistent with the MSTW 
global fit sets than the corresponding comparison between MRST benchmark PDFs and global fit PDFs shown 
in Ref. [1], largely due to the more realistic uncertainties in the MSTW benchmark. Comparing central values 
we see exactly the same feature in the gluon distribution as the NNPDF group, and the explanation is likewise 
the same, highlighting possible difficulties in comparing PDFs obtained with different values of a s (Mz)- 
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Fig. 28: Comparison of the NNPDF benchmark and reference fits for the gluon, d-sea, u v andd„ at Q 2 =4GeV 2 . 



Unlike for the NNPDF group, the MSTW group sees some degree of incompatibility between the bench- 
mark PDFs and the global fit PDFs for the valence quarks, particularly in the case of the down valence. This 
may be related to the assumption u = d, which constrains valence quarks and sea quarks in an artificial manner 
since there is less flexibility to alter each independently. Indeed, in the global fits there is an excess of d over 
u which maximizes at x = 0.1. Forcing equivalence of antiquark distributions might therefore lead to a deficit 
of down sea quarks and a corresponding excess of up sea quarks, and also, for the same reason, to an excess of 
down valence quarks. These are indeed seen both in the NNPDF and MSTW benchmark fits when compared 
to the respective global fits. The effect is however well within the uncertainty bands for NNPDF, which indeed 
do not observe any statistically significant difference between results of a fit to the reduced benchmark data 
set with the u = J assumption (as presented in Fig. [26]) or without it (as presented in Ref. [155], Fig. 12). 

As well as this important effect one sees that the main discrepancy at x = 0.1 for down valence quarks 
is greater when comparing the benchmark fits to the global MSTW fit than to the global MRST fit. This is 
because recent new Tevatron data on Z rapidity distributions and lepton asymmetry from W decays provide 
a strong constraint on the down quark, and some of this new data shows considerable tension with other data 
sets. 

3.4 HI Benchmark 

We now discuss the extension of the fit using the settings of Sect. l3.1.T1 to also include the NNPDF approach. 
Results are compared both to those of the NNPDF reference fit, and to those obtained by the HI fit of Sect. 13.21 
to the same data. We then compare the NNPDF benchmark and reference, with the specific aim of addressing 



the issue of the dependence of the results on the size of the data set (HI dataset vs. the HERA-LHC dataset 
of Sect. 13- 3b - Finally, the HI and NNPDF benchmark fits are compared to each other with the purpose of 
understanding the impact of the respective methodologies. 




Fig. 29: Left: NNPDF benchmark and reference fits at t/s = 301GeV compared to HI charged current data. Center: NNPDF 
reference fit compared to HI and ZEUS neutral current data. Right: NNPDF benchmark fit compared to HI neutral current data. 



The results of the NNPDF benchmark are compared to the NNPDF reference fit results in Fig. [28] 
The general features of the benchmark are analogous to those of the HERA-LHC benchmark discussed in 
Section 13.3.41 with some effects being more pronounced because the benchmark dataset is now even smaller. 
Specifically, we observe that uncertainties bands blow up when data are removed: this is very clear for instance 
in the d distribution at large-x, as a consequence of the fact that the benchmark dataset of Table Q] does not 
include deuterium data. The negative value of this PDF at large x is presumably unphysical and it would 
disappear if positivity of charged current cross sections were imposed, including also the (anti-)neutrino ones. 
The only positivity constraint in the NNPDF fit is imposed on the Fi structure function [155], because this is 
the only DIS observable whose positivity is not constrained by the full data set. 

It is interesting to note however that this effect is not observed for the u v distribution, where instead 
the benchmark and the reference fit show almost equal uncertainties. In order to understand this, in Fig. [29] 
we compare two situations with or without error shrinking, by examining the predictions obtained using the 
benchmark and reference fits for some observables to the corresponding data. A first plot (left) shows the 
shrinking of the uncertainty on the prediction for the charged-current cross section in the reference fit. This 
is mostly due to the CHORUS neutrino data, which are in the reference and not in the benchmark. These data 
are clearly consistent with the HI data shown in the plot. The subsequent pair of plots compares (center) the 
prediction for the neutral-current cross section from the reference fit compared to HI and ZEUS data (both of 
which are used for the reference fit), and (right) from the benchmark fit to the HI data only (which are the only 
ones used in the benchmark fit). The uncertainty bands in the two fits are similar size: indeed, the ZEUS and 
HI data display a systematic disagreement which is approximately the size of this uncertainty band. Hence, 
the (small but significant) systematic inconsistency between the ZEUS and HI data prevents reduction of the 
uncertainty band when the ZEUS data are added to the fit, beyond the size of this discrepancy. Therefore, the 
NNPDF methodology leads to combined uncertainties for inconsistent data which are similar to those obtained 
with the so-called PDG (or scale-factor) method [167]. 

Notice that if relative normalization are fitted (as done by in the HI approach of Sect. 13.21 ) instead of 
being treated simply as a source of systematics, this systematic inconsistency would be significantly reduced 
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Fig. 30: Comparison of the NNPDF and HI benchmark fit for the gluon, d-sea, u v andd„ at Q 2 =4GeV 2 . 



Data Set 


Xm/^data 


XNNPDF/^data 


H197mb 


0.83 


0.82 


H1971owQ2 


0.90 


0.87 


H197NC 


0.69 


0.80 


H197CC 


0.73 


0.97 


H199NC 


0.88 


1.01 


H199CC 


0.62 


0.84 


H199NChy 


0.35 


0.35 


H100NC 


0.97 


1.00 


H100CC 


1.07 


1.38 


Total 


0.88 


0.96 



Table 6: HI and NNPDF \ for the total and each single data set. Cross correlations among data sets are neglected to evaluate the \ 2 
of a single data set. 

in the best-fit. The associate uncertainty however then appears as an addition source of systematics. This 
happens when HI and ZEUS data are combined in a single dataset (see Section [4j] below). In the NNPDF 
approach, instead, this systematics is produced by the Monte Carlo procedure. 



Fit vs H1 PDF2000, Q 2 = 4. GeV 2 




Fig. 31: The Monte Carlo set of gluon PDFs for the HI benchmark (left, same as Fig. [23} and the NNPDF benchmark. The red lines 
show the one-sigma contour calculated from the Monte Carlo set, and in the HI case the black lines show the Hessian one-sigma 
contour. 



3.4.2 Comparison between the Benchmark Parton Distributions 

The x 2 of th e HI and NNPDF benchmarks are given in Tabled while the corresponding PDFs are compared 
in Fig. [30] Furthermore, in Fig. [3]] we show the respective full Monte Carlo PDF sets in the case of the gluon 
distribution. 

The quality of the two fits is comparable, the differences in \ 2 being compatible with statistical fluctua- 
tions. In the region where experimental information is mostly concentrated, specifically for the u v distribution 
over all the x-range and for the d and the d v distributions in the small- x range, the results of the two tits are in 
good agreement, though the HI uncertainty bands are generally somewhat smaller. 

In the region where experimental information is scarce or missing, sizable differences are found, similar 
to those observed when comparing the MRST/MSTW bench and NNPDF bench to the HERA-LHC bench- 
mark of Sect. 13.3.31 Specifically, in these regions NNPDF uncertainties are generally larger than HI bands: 
the width of the uncertainty band for the HI fit varies much less between the data and extrapolation regions 
than that of the NNPDF bench. Also, the HI central value always falls within the NNPDF uncertainty band, 
but the NNPDF central value tends to fall outside the HI uncertainty band whenever the central values differ 
significantly. Figure [3T] suggests that this may be due to the greater flexibility of the functional form in the 
NNPDF fit. Specifically, the d quark distribution at large x does not become negative in the HI fit, because 
this behaviour is not allowed by the parameterization. 



4 DETERMINATION OF PARTON DISTRIBUTIONS 

4.1 Extraction of the proton PDFs from a combined fit of HI and ZEUS inclusive DIS cross sections 40 

4.1.1 Introduction 

The kinematics of lepton hadron scattering is described in terms of the variables Q 2 , the invariant mass of 
the exchanged vector boson, Bjorken x, the fraction of the momentum of the incoming nucleon taken by the 
struck quark (in the quark-parton model), and y which measures the energy transfer between the lepton and 
hadron systems. The differential cross-section for the neutral current (NC) process is given in terms of the 
structure functions by 

where Y± = 1 ± (1 — y) 2 . The structure functions F2 and xF% are directly related to quark distributions, 
and their Q 2 dependence, or scaling violation, is predicted by perturbative QCD. For low x, x < 10~ 2 , F2 
is sea quark dominated, but its Q 2 evolution is controlled by the gluon contribution, such that HERA data 
provide crucial information on low-x sea-quark and gluon distributions. At high Q 2 , the structure function 
XF3 becomes increasingly important, and gives information on valence quark distributions. The charged 
current (CC) interactions also enable us to separate the flavour of the valence distributions at high-x, since 
their (LO) cross-sections are given by, 

d 2 a(e + p) G\ML r . . . . 2 , , ., 

d 2 a(e~p) G 2 P ML r , . . , 9/ - .-, 

-w 2 = (g+^W [( " + c) + ( y) {d+s)] ■ 

Parton Density Function (PDF) determinations are usually obtained in global NLO QCD fits [168-170], 
which use fixed target DIS data as well as HERA data. In such analyses, the high statistics HERA NC e + p 
data have determined the low-x sea and gluon distributions, whereas the fixed target data have determined the 
valence distributions. Now that high-Q 2 HERA data on NC and CC e + p and e~p inclusive double differential 
cross-sections are available, PDF fits can be made to HERA data alone, since the HERA high Q 2 cross-section 
data can be used to determine the valence distributions. This has the advantage that it eliminates the need for 
heavy target collections, which must be applied to the z^-Fe and [iD fixed target data. Furthermore there is 
no need to assume isospin symmetry, i.e. that d in the proton is the same as u in the neutron, since the d 
distribution can be obtained directly from CC e + p data. 

The HI and ZEUS collaborations have both used their data to make PDF fits [170], [18]. Both of these 
data sets have very small statistical uncertainties, so that the contribution of systematic uncertainties becomes 
dominant and consideration of point to point correlations between systematic uncertainties is essential. The 
ZEUS analysis takes account of correlated experimental systematic errors by the Offset Method, whereas HI 
uses the Hessian method [171]. Whereas the resulting ZEUS and HI PDFs are compatible, the gluon PDFs 
have rather different shapes, see Fig [3U and the uncertainty bands spanned by these analyses are comparable 
to those of the global fits. 

It is possible to improve on this situation since ZEUS and HI are measuring the same physics in the 
same kinematic region. These data have been combined using a 'theory-free' Hessian fit in which the only as- 
sumption is that there is a true value of the cross-section, for each process, at each x, Q 2 point [172]. Thus each 
experiment has been calibrated to the other. This works well because the sources of systematic uncertainty in 
each experiment are rather different, such that all the systematic uncertainties are re-evaluated. The resulting 
correlated systematic uncertainties on each of the combined data points are significantly smaller than the sta- 
tistical eiTors. This combined data set has been used as the input to an NLO QCD PDF fit. The consistency of 
the input data set and its small systematic uncertainties enables us to calculate the experimental uncertainties 
on the PDFs using the \ 2 tolerance, A^ 2 = 1. This represents a further advantage compared to the global fit 
analyses where increased tolerances of Ax 2 = 50 - 100 are used to account for data inconsistencies. 



40 Contributing authors: A. Cooper-Sarkar, A. Glazov, G. Li for the Hl-ZEUS combination group. 




Fig. 32: HERAPDFs, xu v ,xd v ,xS, xg at Q 2 = lOGeV 2 . (Left) with experimental uncertainties evaluated as for the central fit (see 
text) and (right) with experimental uncertainties evaluated by accounting for the 47 systematic errors by the Hessian method. 



For the HERAPDF0.1 fit presented here, the role of correlated systematic uncertainties is no longer 
crucial since these uncertainties are relatively small. This ensures that similar results are obtained using either 
Offset or Hessian methods, or by simply combining statistical and systematic uncertainties in quadrature. The 
X 2 per degree of freedom for a Hessian fit is 553/562 and for a quadrature fit it is 428/562. For our central fit 
we have chosen to combine the 43 systematic uncertainties which result from the separate ZEUS and HI data 
sets in quadrature, and to Offset the 4 sources of uncertainty which result from the combination procedure. 
The x 2 P er degree of freedom for this fit is 477/562. This procedure results in the most conservative estimates 
on the resulting PDFs as illustrated in Fig. [32] which compares the PDFs and their experimental uncertainties 
as evaluated by the procedure of our central fit and as evaluated by treating the 47 systematic uncertainties by 
the Hessian method. 

Despite this conservative procedure, the experimental uncertainties on the resulting PDFs are impres- 
sively small and a thorough consideration of further uncertainties due to model assumptions is necessary. In 
Section 14.1.21 we briefly describe the data combination procedure. In Section l4.1.3l we describe the NLO QCD 
analysis and model assumptions. In Section 14.1.41 we give results. In Section l4.1.5l we give a summary of the 
fit results and specifications for release of the HERAPDF0.1 to LHAPDF. In Section 14.1.61 we investigate the 
predictions of the HERAPDF0.1 for W and Z cross-sections at the LHC. 



4.1.2 Data Combination 

The data combination is based on assumption that the HI and ZEUS experiments measure the same cross 
section at the same kinematic points. The systematic uncertainties of the measurements are separated, fol- 
lowing the prescription given by the HI and ZEUS, into point to point correlated sources aj and uncorrected 
systematic uncertainty, which is added to the statistical uncertainty in quadrature to result in total uncorrected 
uncertainty crj for each bin i. The correlated systematic sources are considered to be uncorrected between 
HI and ZEUS. All uncertainties are treated as multiplicative i.e. proportional to the central values, which is a 
good approximation for the measurement of the cross sections. 

A correlated probability distribution function for the physical cross sections M ! ' tmc and systematic 
uncertainties o^true for a single experiment corresponds to a x 2 function: 



M 



i,true 



M +2^7 7577: KIT- V" 
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where M % are the central values measured by the experiment, dM l /daj are the sensitivities to the correlated 
systematic uncertainties and a a are the uncertainties of the systematic sources. For more than one experiment, 
total Xtot can t> e represented as a sum of Xexp- The combination procedure allows to represent Xtot m tne 



following form: 



Xt 2 ot (M^\{3 htrVLe )= X l + Y J 



j\/fi,tiue I j\/fi,stve i 

dM l > ave M l ' true / o 



_l_ (/?j,true) 2 



M i,true y Y 



(51) 

Here the sum runs over a union set of the cross section bins. The value of the Xtot at th e minimum, Xg, quan- 
tifies consistency of the experiments. M !,ave are the average values of the cross sections and [3j correspond 
to the new systematic sources which can be obtained from the original sources ay through the action of an 
orthogonal matrix. In essence, the average of several data sets allows one to represent the total \ 2 in a form 
which is similar to that corresponding to a single data set, Eq.|50l but with modified systematic sources. 

The combination is applied to NC and CC cross section data taken with e + and e~ beams simultaneously 
to take into account correlation of the systematic uncertainties. The data taken with proton beam energies of 
E p = 820 GeV and E p = 920 GeV are combined together for inelasticity y < 0.35, for this a small center 
of mass energy correction is applied. For the combined data set there are 596 data points and 43 experimental 
systematic sources. The Xo/dof = 510/599 is below 1, which indicates conservative estimation of the 
uncorrected systematics. 

Besides the experimental uncertainties, four additional sources related to the assumptions made for 
the systematic uncertainties are considered. Two of the extra sources deal with correlation of the HI and 
ZEUS data for estimation of the photoproduction background and simulation of hadronic energy scale. These 
sources introduce additional ~ 1% uncertainty for y > 0.6 and y < 0.02 data. The third source covers 
uncertainty arising from the center of mass correction by varying Fl = F® CD to Fl = 0. The resulting 
uncertainty reaches few per mille level for y ~ 0.35. Finally, some of the systematic uncertainties, for 
example background subtraction, may not be necessary multiplicative but rather additive, independent of the 
cross section central values. The effect of additive assumption for the errors is evaluated by comparing the 
average obtained using Eq. [50]and an average in which M !,tmc /M !,ave scaling is removed for all but global 
normalization errors. 



4.1.3 QCD Analysis 

The QCD predictions for the structure functions are obtained by solving the DGLAP evolution equations [102, 
104, 105] at NLO in the MS scheme with the renormalisation and factorization scales chosen to be Q 2 41 . The 
DGLAP equations yield the PDFs at all values of Q 2 provided they are input as functions of x at some input 
scale Qq. This scale has been chosen to be Q\ = 4GeV 2 and variation of this choice is considered as one 
of the model uncertainties. The resulting PDFs are then convoluted with NLO coefficient functions to give 
the structure functions which enter into the expressions for the cross-sections. The choice of the heavy quark 
masses is, m c = 1.4, = 4.75GeV, and variation of these choices is included in the model uncertainties. For 
this preliminary analysis, the heavy quark coefficient functions have been calculated in the zero-mass variable 
flavour number scheme. The strong coupling constant was fixed to a: s (M§) = 0.1176 [167], and variations in 
this value of ±0.002 have also been considered. 

The fit is made at leading twist. The HERA data have a minimum invariant mass of the hadronic system, 
W 2 , of W^ in = 300 GeV 2 and a maximum x, x max = 0.65, such that they are in a kinematic region where 
there is no sensitivity to target mass and large-x higher twist contributions. However a minimum Q 2 cut 
is imposed to remain in the kinematic region where perturbative QCD should be applicable. This has been 
chosen to be Q 2 min = 3.5 GeV 2 . Variation of this cut is included as one of the model uncertainties. 

A further model uncertainty is the choice of the initial parameterization at Qq. Three types of pa- 
rameterization have been considered. For each of these choices the PDFs are parameterized by the generic 
form 

xf(x) = Ax B {l - x) c (l + Dx + Ex 2 + Fx 3 ), (52) 
41 The programme QCDNUM [163] has been used and checked against the programme QCDfit [173]. 



and the number of parameters is chosen by 'saturation of the \ 2 \ such that parameters D, E, F are only varied 
if this brings significant improvement to the x 2 - Otherwise they are set to zero. 

The first parameterization considered follows that used by the ZEUS collaboration. The PDFs for u 
valence, xu v (x), d valence, xd v (x), total sea, xS(x), the gluon, xg(x), and the difference between the d and 
u contributions to the sea, xA(x) = x(d — u), are parameterized. 

xu v {x) = A uv x Bw {\ - x) C ™(l + D uv x + E uv x 2 ) 

xd v (x) = A dv x Bdv (l - x) Cdv 

xS(x) = A s x Bs (l-x) Cs 

xg(x) = A g x B »(l - x)° 3 (l + D g x) 

xA(x) = A a x Ba (1 - xf A 

The total sea is given by, xS = 2x{u + d + s + c + b), where q = q sea for each flavour, u = u v + u sea , d = 
d v + d sea and q = q sea for all other flavours. There is no information on the shape of the xA distribution 
in a fit to HERA data alone and so this distribution has its parameters fixed, such that its shape is consistent 
with Drell-Yan data and its normalization is consistent with the size of the Gottfried sum-rule violation. A 
suppression of the strange sea with respect to the non-strange sea of a factor of 2 at Qq, is imposed consistent 
with neutrino induced dimuon data from NuTeV. The normalisation parameters, A uv , Ad v , A g , are constrained 
to impose the number sum-rules and momentum sum-rule. The B parameters, B uv and B^ v are set equal, since 
there is no information to constrain any difference. Finally this ZEUS -style parameterization has eleven free 
parameters. 

The second parameterization considered follows that of the HI Collaboration The choice of quark PDFs 
which are parameterized is different. The quarks are considered as u-type and d-type, xU = x(u v + u sea + c), 
xD = x(d v + d sea + s), xU = x(u + c) and xD = x(d + s), assuming q sea = q, as usual. These four 
(anti-)quark distributions are parameterized separately. 

xU(x) = A uX Bu (1 - x) Cu (1 + D v x + Eux 2 + F uX 3 ) 

xD(x) = A d x Bd (1 - xf D (l + D D x) 



xU(x) 


= A D x B u(l 


- x) c ° 


xD(x) 


= A B x B °(l 


- X) C D 


xg(x) 


= A g x B *(l- 


- x)° 3 



Since the valence distributions must vanish as x — ► 0, the parameters, A and B are set equal for xU and xU; 
Ajj = A v , Bjj = By; and for xD and xD; Ad = A D , Bp = B D . Since there is no information on the 
flavour structure of the sea it is also necessary to set B v = B D , such that there is a single B parameter for all 
four quark distributions. The normalisation, A g , of the gluon is determined from the momentum sum-rule and 
the parameters Djj and Djj are determined by the number sum-rules. Assuming that the strange and charm 
quark distributions can be expressed as x independent fractions, f s = 0.33 and f c = 0.15, of the d and u type 
sea respectively, gives the further constraint A v = A D (1 — f s ) /(l — f c ), which ensures that u = d at low x. 
Finally this HI -style parameterization has 10 free parameters. 

The third parameterization we have considered combines the best features of the previous two. It 
has less model dependence than the ZEUS -style parameterization in that it makes fewer assumptions on the 
form of sea quark asymmetry xA, and it has less model dependence than the Hl-style parameterization in 
that it does not assume equality of all B parameters. Furthermore, although all types of parameterization give 
acceptable x 2 values, the third parameterization has the best \ 2 an d it gives the most conservative experimental 
errors. This is the parameterization which we chose for our central fit. The PDFs which are parameterized are 
xu v , xd v , xg and xU, xD. 

xu v (x) = A uv x Buv (l - x) Cuv (l + D uv x + E uv x 2 ) 




Fig. 33 : HERAPDFs, xu v ,xd v ,xS,xg and their uncertainties at Q 2 = lOGeV 2 . (Left) for the central fit; (centre) for the ZEUS-style 
parameterization; (right) for the Hl-style parameterization 



Model variation 


Standard value 


Upper Limit 


Lower limit 


m c 


1.4 


1.35 


1.5 


m b 


4.75 


4.3 


5.0 


O 2 


3.5 


2.5 


5.0 


Ql 


4.0 


2.0 


6.0 


fs 


0.33 


0.25 


0.40 


fc 


0.15 


0.12 


0.18 



Table 7: Standard values of input parameters and cuts, and the variations considered to evaluate model uncertainty 

xd v (x) = A dv x Bdv (l - x) Cdv 
xU(x) = A v x B o{l - x fu 
xD(x) = A 5 x b d(1 - x) c d 
xg(x) = A g x Bs (l - x) Cg 

The normalisation parameters, A uv , A dv , A g , are constrained to impose the number sum-rules and momentum 
sum-rule. The B parameters, B uv and Bd v are set equal, B uv = Bd v and the B parameters B(j and Bq are 
also set equal, Bp = Bq, such that there is a single B parameter for the valence and another different single B 
parameter for the sea distributions. Assuming that the strange and charm quark distributions can be expressed 
as x independent fractions, f s = 0.33 and f c = 0.15, of the d and u type sea, gives the further constraint 
Ajj = Ajj(l — / s )/(l — fc)- The value of f s = 0.33 has been chosen to be consistent with determinations 
of this fraction using neutrino induced di-muon production. This value has been varied to evaluate model 
uncertainties. The charm fraction has been set to be consistent with dynamic generation of charm from the 
start point of Q 2 = m 2 , in a zero-mass-variable-flavour-number scheme. A small variation of the value of f c 
is included in the model uncertainties. Finally this parameterization has 1 1 free parameters. 

It is well known that the choice of parameterization can affect both PDF shapes and the size of the PDF 
uncertainties. Fig[33]compares the PDFs and their uncertainties as evaluated using these three different param- 
eterizations. As mentioned earlier, the third parameterization results in the most conservative uncertainties. 

We present results for the HERA PDFs based on the third type of parameterization, including six sources 
of model uncertainty as specified in Table |7] We also compare to results obtained by varying a s (M|) and by 
varying the choice of parameterization to those of the ZEUS and the HI styles of parameterization. 

4.1.4 Results 

In Fig. [34] we show the HERAPDF0. 1 superimposed on the combined data set for NC data and CC data. In 
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Fig. 34: HERA combined NC (left) and CC (right) data. The predictions of the HERAPDF0.1 fit are superimposed. The uncertainty 
bands illustrated derive from both experimental and model sources 




Fig. 35: Left: HERA combined NC data at low Q 2 . Right: the NC reduced cross-section vs Q 2 for three x-bins. The predictions of 
the HERAPDFO. 1 fit are superimposed, together with the predictions of the ZEUS-JETS and H1PDF2000 PDFs 



Fig [35] we show the NC data at low Q 2 , and we illustrate scaling violation by showing the reduced cross- 
section vs. Q 2 for a few representative x bins. The predictions of the HERAPDFO. 1 fit are superimposed, 
together with the predictions of the ZEUS-JETS and H1PDF2000 PDFs. 

Fig. [36] shows the HERAPDFO. 1 PDFs, xu v ,xd V: xS,xg, as a function of x at the starting scale 
Q 2 = 4 GeV 2 and at Q 2 = 10 GeV 2 . Fig. [37] shows the same PDFs at the scales Q 2 = 100, 10000 GeV 2 . 
Fractional uncertainty bands are shown beneath each PDF. The experimental and model uncertainties are 
shown separately. As the PDFs evolve with Q 2 the total uncertainty becomes impressively small. 

The total uncertainty of the PDFs obtained from the HERA combined data set is much reduced com- 
pared to the PDFs extracted from the analyses of the separate HI and ZEUS data sets, as can be seen from 
the summary plot Fig. [38] where these new HERAPDFO. 1 PDFs are compared to the ZEUS-JETS and 
H1PDF2000 PDFs. It is also interesting to compare the present HERAPDFO. 1 analysis of the combined 
HERA-I data set with an analysis of the separate data sets which uses the same parameterization and as- 
sumptions. Fig [39] makes this comparison. It is clear that it is the data combination, and not the choice of 
parameterization and assumptions, which has resulted in reduced uncertainties for the low-x gluon and sea 
PDFs. 

The break-up of the HERAPDFs into different flavours is illustrated in Fig. [40] where the PDFs xll, 




Fig. 36: HERAPDFs, xu v ,xd v ,xS,xg, at (left) Q 2 — 4 GeV 2 and (right) Q 2 — 10 GeV 2 . Fractional uncertainty bands are shown 
beneath each PDF. The experimental and model uncertainties are shown separately as the red and yellow bands respectively 
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Fig. 37: HERAPDFs, xu v , xd v ,xS, xg, at (left) Q 2 = 100 GeV 2 and (right) Q 2 = 10000 GeV 2 . Fractional uncertainty bands are 
shown beneath each PDF. The experimental and model uncertainties are shown separately as the red and yellow bands respectively 
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Fig. 38: Left: PDFs from the ZEUS-JETS and H1PDF2000 PDF separate analyses of ZEUS and HI. Right: HERAPDFO. 1 PDFs 
from the analysis of the combined data set 




Fig. 39: Left: PDFs resulting from an analysis of the HI and ZEUS separate data sets using the same parameterization and assumptions 
as HERAPDF0.1. Right: HERAPDFO. 1 PDFs from the analysis of the combined data set (experimental uncertainties only) 
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Fig. 40: HERAPDFs at Q 2 = lOGeV 2 : (left) xU, xD,xU ,xD; (right) xu,xd Fractional uncertainty bands are shown 

beneath each PDF. The experimental and model uncertainties are shown separately as the red and yellow bands respectively 

xD, xU, xD and xu,xd,xc,xs are shown at Q 2 = 10 GeV 2 . The model uncertainty on these PDFs from 
variation of Q 2 min , Qq, m c and mj is modest. The model uncertainty from variation of f s and f c is also modest 
except for its obvious effect on the charm and strange quark distributions. 

It is also interesting to look at the results obtained from using the ZEUS-style and HI style param- 
eterizations described in Section 14.1.31 In Fig. @T] these alternative parameterizations are shown as a blue 
line superimposed on the HERAPDFO. 1 PDFs. These variations in parameterization produce changes in the 
resulting PDFs which are comparable to the experimental uncertainties in the measured kinematic range. A 
further variation of parameterization originates from the fact that, if the D parameter for the gluon is allowed 
to be non-zero, then each type of parameterization yields a double minimum in x 2 sucn that the gluon may 
take a smooth or a 'humpy' shape. Although the lower x 2 is obtained for the for the smooth shape, the \ 2 
for the 'humpy' shape is still acceptable. The PDFs for the 'humpy' version of our chosen form of parame- 
terization are compared to the standard version in Fig. [42l where they are shown as a blue line superimposed 
on the HERAPDFO. 1 PDFs. This comparison is shown at Q 2 = 4GeV 2 , where the difference is the greatest. 
Nevertheless the resulting PDFs are comparable to those of the standard choice. This explains a long-standing 
disagreement in the shape of the gluon obtained by the separate ZEUS-JETS and H1PDF200 analyses. The 
ZEUS data favoured the smooth shape and the HI data favoured the 'humpy' shape. However the precision 
of the combined data set results in PDFs for these shapes which are not significantly different in the measured 
kinematic region. 




Fig. 41: HERAPDFs at Q 2 = lOGeV 2 : with the results for the ZEUS-style parameterization (left) and for the HI -style parameteriza- 
tion (right) superimposed as a blue line. 




Fig. 42: HERAPDFs at Q 2 = 4GeV 2 : with the results for the humpy version superimposed as a blue line. 
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Fig. 43: HERAPDFs at Q 2 = lOGeV 2 : with the results for q s (M|) = 0.1156 (left) and for a a {M%) = 0.1196 (right) superimposed 
as a blue line. 




Fig. 44: HERAPDFs at Q 2 = lOGeV 2 compared to the PDFs from CTEQ6.1 and MRST01 



It is also interesting to compare the PDFs for the standard choice to those obtained with a different input 
value of a s (M|). The uncertainty on the current PDG value of a s (M^) is ±0.002 and thus we vary our central 
choice by this amount. The results are shown in Fig.|43l where we can see that this variation only affects the 
gluon PDF, such that the larger(smaller) value of o: s (M§) results in a harder(softer) gluon as predicted by the 
DGLAP equations. The change is outside total uncertainty bands of the standard fit. Finally, Figs. [44] and 
[45] compare the HERAPDF0. 1 PDFs to those of the CTEQ and the MRST/MSTW groups respectively. The 
uncertainty bands of the CTEQ and MRST/MSTW analyses have been scaled to represent 68% CL limits for 
direct comparability to the HERAPDF0. 1. The HERAPDF0.1 analysis has much improved precision on the 
low-x gluon. 

4.1.5 Summary of HERAPDFO. 1 results 

Now that high-Q 2 HERA data on NC and CC e + p and e~p inclusive double differential cross-sections are 
available, PDF fits can be made to HERA data alone, since the HERA high Q 2 cross-section data can be 
used to determine the valence distributions and HERA low Q 2 cross-section data can be used to determine 
the Sea and gluon distributions. The combined HERA-I data set, of neutral and charged current inclusive 
cross-sections for e + p and e~p scattering, has been used as the sole input for an NLO QCD PDF fit in the 
DGLAP formalism. The consistent treatment of systematic uncertainties in the joint data set ensures that 
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Fig. 45: HERAPDFs at Q 2 = lOGeV 2 compared to the PDFs from CTEQ6.5 and MSTW08(prel.) 



experimental uncertainties on the PDFs can be calculated without need for an increased x 2 tolerance. This 
results in PDFs with greatly reduced experimental uncertainties compared to the separate analyses of the ZEUS 
and HI experiments. Model uncertainties, including those arising from parameterization dependence, have 
also been carefully considered. The resulting HERAPDFs (called HERAPDFO. 1) have improved precision at 
low-x compared to the global fits, this will be important for predictions of the W and Z cross-sections at the 
LHC, as explored in the next Section. 

These PDFs have been released on LHAPDF in version LHAPDF.5.6: they consist of a central value 
and 22 experimental eigenvectors plus 12 model alternatives. The user should sum over Nmem=l,22 for 
experimental uncertainties and over Nmem=l,34 for total uncertainties. 



4.1.6 Predictions for W and Z cross-sections at the LHC using the HERAPDFO. 1 

At leading order (LO), W and Z production occur by the process, qq — ► W/Z, and the momentum fractions 
of the partons participating in this subprocess are given by, xx,2 = ^exp(±y), where M is the centre of mass 
energy of the subprocess, M = My/ or Mz, \/s is the centre of mass energy of the reaction (^/s = 14 TeV 
at the LHC) and y = ^In ^^jj gives the parton rapidity. The kinematic plane for LHC parton kinematics 
is shown in Fig. |46l Thus, at central rapidity, the participating partons have small momentum fractions, 
x ~ 0.005. Moving away from central rapidity sends one parton to lower x and one to higher x, but over the 
central rapidity range, \y\ < 2.5, x values remain in the range, 5 x 10~ 4 <i<5x 10~ 2 . Thus, in contrast 
to the situation at the Tevatron, the scattering is happening mainly between sea quarks. Furthermore, the high 
scale of the process Q 2 = M 2 ~ 10, 000 GeV 2 ensures that the gluon is the dominant parton, see Fig. [46j 
so that these sea quarks have mostly been generated by the flavour blind g — > qq splitting process. Thus the 
precision of our knowledge of W and Z cross-sections at the LHC is crucially dependent on the uncertainty 
on the momentum distribution of the low-x gluon. 

HERA data have already dramatically improved our knowledge of the low-x gluon, as discussed in ear- 
lier proceedings of the HERALHC workshop [1]. Now that the precision of HERA data at small-x have been 
dramatically improved by the combination of HI and ZEUS HERA-I data, we re-investigate the consequences 
for predictions of W, Z production at the LHC. 

Predictions for the W/Z cross-sections, decaying to the lepton decay mode, using CTEQ, ZEUS PDFs 
and the HERAPDFO. 1 are summarised in Table [8] Note that the uncertainties of CTEQ PDFS have been 
rescaled to represent 68% CL, in order to be comparable to the HERA PDF uncertainties. The precision on 
the predictions of the global fits (CTEQ6.1/5 and ZEUS-2002) for the total W/Z cross-sections is ~ 3% at 
68% CL. The precision of the ZEUS-2005 PDF fit prediction, which used only ZEUS data, is comparable, 
since information on the low-x gluon is coming from HERA data alone. The increased precision of the 
HERAPDF0.1 low-rc gluon PDF results in increased precision of the W/Z cross-section predictions of ~ 1%. 
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Fig. 46: Left plot: The LHC kinematic plane (thanks to James Stirling). Right plot: Typical PDF distributions at Q 2 = 10, 000 GeV 2 . 



PDF Set 


a{W+).B(W + - 


>l + vt) a(W-).B(W- 


■*l-ih) a(Z).B(Z 


CTEQ6.1 


11.61 ± 0.34 nb 


8.54 ± 0.26 nb 


1.89 ± 0.05 nb 


CTEQ6.5 


12.47 ± 0.28 nb 


9.14 ± 0.22 nb 


2.03 ± 0.04 nb 


ZEUS-2002 


12.07 ± 0.41 nb 


8.76 ± 0.30 nb 


1.89 ± 0.06 nb 


ZEUS-2005 


11.87 ± 0.45 nb 


8.74 ± 0.31 nb 


1.97 ± 0.06 nb 


HERAPDF0. 1 


12.14 ± 0.13 nb 


9.08 ± 0.14 nb 


1.99 ± 0.025 nb 



Table 8: LHC W/Z cross-sections for decay via the lepton mode, for various PDFs, with 68% CL uncertainties. 

It is interesting to consider the predictions as a function of rapidity. Fig [47] shows the predictions 
for W + ,W~ , Z production as a function of rapidity from the HERAPDF0.1 PDF fit and compares them 
to the predictions from a PDF fit, using the same parameterization and assumptions, to the HI and ZEUS 
data from HERA-I uncombined. The increase precision due to the combination is impressive. Fig. [48] show 
the predictions for W + , W~, Z production as a function of rapidity from the CTEQ6. 1, 6.6 and MRST01 
PDF fits for comparison. The uncertainties on the CTEQ and MRST PDF predictions have been rescaled to 
represent 68% CL limits, for direct comparability to the HERAPDF0.1 uncertainties. At central rapidity these 
limits give an uncertainty on the boson cross-sections of ~ 5%, (~ 3%),(~ 2%) for CTEQ6.1, (CTEQ6.6), 
(MRST01) compared to ~ 1% for the HERAPDF0. 1 . 

So far, only experimental uncertainties have been included in these evaluations. It is also necessary to 
include model uncertainties. Fig. [49] shows the W + , W~ , Z rapidity distributions including the six sources 
of model uncertainty detailed in Section 14.1.31 These model uncertainties increase the total uncertainty at 
central rapidity to ~ 2%. Further uncertainty due to the choice of a s (Mz) is small because, although a lower 
(higher) choice results in a larger (smaller) gluon at low x, the rate of QCD evolution is lower (higher) and this 
largely compensates. Uncertainties due to the choice of parameterization also have little impact on the boson 
rapidity spectra in the central region as illustrated in Fig. [49]by the superimposed blue line, which represents 
the alternative 'humpy' gluon parameterization (see Sec. 14.1.4"] ). 

Since the PDF uncertainty feeding into the W + ,W~ and Z production is mostly coming from the 
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Fig. 47: The W + , W , Z rapidity distributions, Aw and Rzw (see text) and their uncertainties as predicted by (left) HERAPDFO. 1 
(right) a similar fit to the uncombined ZEUS and HI data from HERA-I. 
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Fig. 48: The W + , W , Z rapidity distributions, Aw and Rzw (see text) and their uncertainties (scaled to 68% CL) as predicted by 
(left) CTEQ6.1, (middle) CTEQ6.6, right (MRST01 
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Fig. 49: Left: the W + , W~ , Z rapidity distributions, Aw, and Rzw (see text) and their experimental uncertainties (red) and model 
uncertainties (yellow). Right: the l + , l~ rapidity distributions, Ai and Rzi (see text) and their experimental and model uncertainties. 
The superimposed blue line represents the results of the alternative 'humpy' gluon parameterization. 

gluon PDF, for all three processes, there is a strong correlation in their uncertainties, which can be removed 
by taking ratios. Figs. [47l[48] and [49] also show the W asymmetry 

A w = (W + - W~)/(W + + W~). 

The experimental PDF uncertainty on the asymmetry is larger (~ 5% for both CTEQ and HERAPDFs, ~ 7% 
for the MRST01 PDFs) than that on the individual distributions and the variation between PDF sets is also 
larger - compare the central values of the CTEQ and MRST predictions, which are almost 25% discrepant. 
This is because the asymmetry is sensitive to the difference in the valence PDFs, u v — d v , in the low-x region, 
5 x 10 -4 <i<5x 10 -2 , where there is no constraint from current data. To see this consider that at LO, 

A\y ~ (ud — du) j (ud + du + cs + sc) 

and that d ~ u at low-x. (Note that the cs and sc contributions cancel out in the numerator). The discrepancy 
between the CTEQ and MRST01 asymmetry predictions at y = can be quantitatively understood by consid- 
ering their different valence PDFs (see Figs.@4j|45]in Sec. l4.1.4"T >. In fact a measurement of the asymmetry at 
the LHC will provide new information to constrain these PDFs. 

By contrast, the ratio 

Rzw = Z/{W + + W~), 

also shown in Figs. |47j|48] and |49j has very small PDF uncertainties (both experimental and model) and there 
is no significant variation between PDF sets. To understand this consider that at LO 

Rzw — (uu + dd + cc + ss) j (ud + du + cs + sc) 

(modulo electroweak couplings) and that d ~ u at low-x 42 . This will be a crucial measurement for our 
understanding of Standard Model Physics at the LHC. 

However, whereas the Z rapidity distribution can be fully reconstructed from its decay leptons, this is 
not possible for the W rapidity distribution, because the leptonic decay channels which we use to identify the 
W's have missing neutrinos. Thus we actually measure the W's decay lepton rapidity spectra rather than the 

42 There is some small model dependence from the strange sea fraction accounted for in both HERAPDFO. 1 and in CTEQ6.6 PDFs. 



W rapidity spectra. Fig. |49] also shows the rapidity spectra for positive and negative leptons from W + and 
W~ decay, the lepton asymmetry, 

A l = (i + -i-)/(i+ + r) 

and the ratio 

Rzi = z/(i + + r) 

A cut of, p t i > 25 GeV, has been applied on the decay lepton, since it will not be possible to trigger on leptons 
with small p t i. A particular lepton rapidity can be fed from a range of W rapidities so that the contributions 
of partons at different x values is smeared out in the lepton spectra, but the broad features of the W spectra 
remain. 

In summary, these investigations indicate that PDF uncertainties, deriving from experimental error, on 
predictions for the W, Z rapidity spectra in the central region, have reached a precision of ~ 1%, due to the 
input of the combined HERA-I data. This level of precision is maintained when using the leptons from the 
W decay and gives us hope that we could use these processes as luminosity monitors 43 . However, model 
dependent uncertainties must now be considered very carefully. The current study will be repeated using a 
general-mass variable-flavour scheme for heavy quarks. 

The predicted precision on the ratios Rzw> Rzi is even better since model uncertainties are also very 
small giving a total uncertainty of ~ 1%. This measurement may be used as a SM benchmark. However the 
W and lepton asymmetries have larger uncertainties (5 — 7%). A measurement of these quantities would give 
new information on valence distributions at small-x. 



4.2 Measurements of the Proton Structure Function Fl at HERA 44 

4. 2. 1 Introduction 

The inclusive deep inelastic ep scattering (DIS) cross section can at low Q 2 be written in terms of the two 
structure functions, F2 and Fl, in reduced form as 

where Q 2 = —q 2 is the negative of the square of the four-momentum transferred between the electron 45 and 
the proton, and x = Q 2 /2qP denotes the Bjorken variable, where P is the four-momentum of the proton. The 
two variables are related through the inelasticity of the scattering process, y = Q 2 /sx, where s = AE e E p 
is the centre-of-mass energy squared determined from the electron and proton beam energies, E e and EL. In 
eq.l53l a denotes the fine structure constant and Y+ = 1 + (1 — y) 2 . 

The two proton structure functions F 2 and Fl are related to the cross sections of the transversely and 
longitudinally polarised virtual photons interacting with protons, ol and ar, according to Fl oc o~l and 
F 2 oc (a L + 0" T ). Therefore the relation < F L < F 2 holds. In the Quark Parton Model (QPM), F 2 is 
the sum of the quark and anti-quark x distributions, weighted by the square of the electric quark charges, 
whereas the value of Fl is zero [174]. The latter follows from the fact that a quark with spin \ cannot absorb 
a longitudinally polarised photon. 

In Quantum Chromodynamics (QCD), Fl differs from zero, receiving contributions from quarks and 
from gluons [175]. At low x and in the Q 2 region of deep inelastic scattering the gluon contribution greatly 
exceeds the quark contribution. Therefore Fl is a direct measure of the gluon distribution to a very good 
approximation. The gluon distribution is also constrained by the scaling violations of F 2 as described by 
the DGLAP QCD evolution equations [102-105, 176]. An independent measurement of Fl at HERA, and its 
comparison with predictions derived from the gluon distribution extracted from the Q 2 evolution of F 2 (x, Q 2 ), 



43 A caveat is that the current study has been performed using PDF sets which are extracted using NLO QCD in the DGLAP 
formalism. The extension to NNLO gives small corrections ~ 1%. However, there may be much larger uncertainties in the theoretical 
calculations because the kinematic region involves low-j;. There may be a need to account for ln(l/x) resummation or high gluon 
density effects. 

44 Contributing authors: J. Grebenyuk, V. Lendermann 

45 The term electron is used here to denote both electrons and positrons unless the charge state is specified explicitly. 



thus represents a crucial test on the validity of perturbative QCD (pQCD) at low x. Moreover, depending on 
the particular theoretical approach adopted, whether it be a fixed order pQCD calculation, a re-summation 
scheme, or a color dipole ansatz, there appear to be significant differences in the predicted magnitude of Fl at 
low Q 2 . A measurement of Fl may be able to distinguish between these approaches. 

Previously the structure function Fl was extracted by the HI collaboration from inclusive data at high 
y using indirect methods, as discussed in Sect. l4.2.2l A preliminary measurement was also presented by the 
ZEUS collaboration using initial state radiation (ISR) events [177], although the precision of this measurement 
was limited. 

To make a direct measurement of Fl, reduced cross sections must be measured at the same x and Q 2 
but with different y values. This can be seen from eq.[53] which states that Fl(x, Q 2 ) is equal to the partial 
derivative da r (x, Q 2 , y) / d(y 2 /Y + ). Due to the relationship y = Q 2 /xs this requires data to be collected at 
different beam-beam centre-of-mass energies, which was done in the last year of HERA running. To maximize 
the precision of this procedure, the measurable range of y 2 /Y + had to be maximised for each fixed x and Q 2 . 
This was achieved by operating HERA at the lowest attainable centre-of-mass energy and by measuring this 
data up to the highest possible value of y. An intermediate HERA centre-of-mass energy was also chosen, to 
improve the precision of Fl extraction and to act as a consistency check. More specifically, between March 
and June 2007, HERA was operated with proton beam energies, E p = 460 GeV and 575 GeV, compared to the 
previous nominal value of 920 GeV. The electron beam energy was unaltered at E e = 27.6 GeV. Thus, three 
data sets, referred to the high- (HER), middle- (MER) and low-energy running (LER) samples, were collected 
with yfs =318 GeV, 25 1 GeV and 225 GeV, respectively. The integrated luminosities of the data sets used 
by ZEUS (HI) to measure F L are 32.8 (21.6) pb" 1 for HER, 6(6.2)pb- x for MER and 14 (12.4) pb" 1 for 
LER. The specific issues of the recent HI and ZEUS analyses are discussed in Sect. l4.2.3l and the results are 
presented in Sect. l4.2.4l 



4.2.2 Indirect Fl Extraction by HI 

HI extracted Fl from inclusive data using several indirect methods, which exploit the turn over of the reduced 
cross section at high y due to the Fl contribution. The basic principle is the following. First, the reduced 
neutral current cross section a T is measured in a y range, where the Fl contribution is negligible and thus the 
relation a r = F2 holds very well. Afterward, based on some theoretical assumption, the knowledge of F2 is 
extrapolated towards high y. Finally Fl is extracted from the difference between the prediction for Fi and the 
measurement of a r at high y. 

In the analyses at Q 2 > lOGeV 2 [18, 89, 178] the "extrapolation" method is used. In this method, an 
NLO QCD PDF fit to HI HERA I data is performed at y < 0.35, and the results are extrapolated to higher y 
using the DGLAP evolution equations. Fl is then extracted at a fixed y =0.75 and at Q 2 up to 700 GeV 2 
using eq.[53] The extracted values are shown in Fig.|50]for the high-Q 2 analysis [18]. 

At low Q 2 , extrapolations of DGLAP fits become uncertain. For Q 2 < 2 GeV 2 , as the strong coupling 
constant a s (Q 2 ) increases, the higher order corrections to the perturbative expansion become large and lead to 
the breakdown of the pQCD calculations. Therefore other methods are used in the HI low-Q 2 data analyses. 

The "shape method", as used in the last HI low-Q 2 study of HERA I data [179], exploits the shape of 
o> in a given Q 2 bin. The Q 2 dependence at high y is driven by the kinematic factor y 2 /Y + (eq.[53l. and to 
a lesser extent by Fl(x,Q 2 ). On the other hand, the gluon dominance at low x suggests that Fl may exhibit 
an x dependence similar to F%. Therefore it is assumed that Fl is proportional to F2 and the coefficient of 
proportionality depends only on Q 2 . In the extraction procedure one uses the ratio R of the cross sections of 
the transversely and longitudinally polarised photons 

a L F 2 - F L 

which is thus assumed to depend only on Q 2 . The reduced cross section is fitted by 

y 2 R(Q 2 ) 
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Fig. 50: Fl determined indirectly by HI at a fixed y = 0.75 and high Q 2 is shown as a function of Q 2 (lower scale) or equivalently 
x (upper scale) for e + p (closed circles) and e~p (open circles) data. The inner error bar represents the statistical error, and the outer 
error bar also includes the systematic error and the uncertainty arising from the extrapolation of F2 . 
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Fig. 51: Q 2 dependence of Fl(x, Q 2 ) at fixed y —0.75, extracted from the preliminary HI low-Q 2 data. The solid line shows the 
prediction of the fractal fit with a constant R. 



where some phenomenological model for F2 is chosen. 

An example of such an extraction using a fractal fit for F2 [180] is shown in Fig.[5TJ where preliminary 
HI results [179] for Fl at y =0.75 in the range of 0.35 < Q 2 < 8.5 GeV 2 are presented. The data favour a 
positive, not small Fl at low Q 2 . A drawback of this method is that it reveals a considerable dependence of R 
on the choice of the F2 model. 

In the derivative method [89, 179], Fl is extracted from the partial derivative of the reduced cross section 
on y at fixed Q 2 
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Fig. 52: Structure function Fl extracted by HI using the derivative method. The solid line shows the prediction of the fractal fit with a 
constant R. The inner error bars represent statistical uncertainties, the outer error bars represent statistical and systematic uncertainties 
added in quadrature. The solid (yellow) band indicates the model uncertainty. 

which is dominated by the Fl -dependent term at high y. The term proportional to OFl/Ox is negligible 
for moderately varying parametrisations of Fl. For low Q 2 values the rise of F2 is weak. The change 
of the term xdFijdx for the two assumptions: no rise at low x, i.e. dFildx = 0, and F2 oc x~ x is 
numerically significantly smaller than the experimental precision for da r /dlny. Therefore the derivative 
methods provides a means for determining Fl at low Q 2 with minimal phenomenological assumption. On the 
other hand, the errors obtained with the derivative method turn out to be significantly larger than those from 
the shape method. 

The preliminary results of Fl extraction from HI HERA I data [179] are presented in Fig. [52] The 
residual dependence of the measurement on the assumption made for F2 is estimated by a comparison with 
results obtained assuming an F2 which is flat in y. The lower bound on Fl obtained this way is depicted as a 
solid band in the figure. 

4.2.3 Details of Direct Fl Measurements 

The HI and ZEUS analysis procedures involve a measurement of the inclusive cross section at y > 0. 1. In this 
range, the kinematic variables x, y and Q 2 are most accurately reconstructed using the polar angle, 6 e , and the 
energy, E' e , of the scattered electron according to 



Reaching the high y values necessary for the Fl determination requires a measurement of the scattered elec- 
tron with energy down to a few GeV. The electron candidate is selected as an isolated electromagnetic energy 
deposition (cluster) in a calorimeter. The crucial analysis issue at high-y region is the identification of the scat- 
tered electron, and the estimation of the hadronic background which occurs when a particle from the hadronic 
final state mimics the electron signal. Most of background events are photoproduction (72?) events with Q 2 « 
in which the final state electron is scattered at low angles (high 9) 46 and thus escapes through the beam pipe. 

The 7p background suppression is performed in several steps. Firstly, calorimeter shower estimators are 
utilised which exploit the different profiles of electromagnetic and hadronic showers. Secondly, background 

46 The z axis of the right-handed coordinate systems used by HI and ZEUS is defined by the direction of the incident proton beam 
with the origin at the nominal ep interaction vertex. Consequently, small scattering angles of the final state particles correspond to 
large polar angles in the coordinate system. 



-1 E 'e ■ 2®e 

y = 1 — —sin — 

Ee 2 




(57) 



ZEUS 



f 3500 
§•3000 
2500 
2000 
1500 
1000 
500 




5 10 15 20 25 30 35 40 
E e (GeV) 



» 

05000 

111 

4000 








3000 








2000 








1000 









30 35 40 45 50 55 60 65 70 75 80 
E-P z (GeV) 




20 40 60 80 100120140160181 

M°) 




• ZEUS (prel.) 

Vs=252 GeV (6pb ') 
I I MC DIS + 7P 
I I MCyp 



Fig. 53: Comparison of 575 GeV data with the sum of DIS and background simulations for the energy of the scattered electron, total 
E — p z , theta of the scattered electron, angle of the hadronic final state and z coordinate of the vertex. The dotted lines indicate the 
cuts applied. 



coming from neutral particles, such as ttq, can be rejected by requiring a track associated to the electron 
candidate. Furthermore, jp events are suppressed by utilising the energy-momentum conservation. For that, 
the variable E — p z = T,i(Ei — p z ^i) is exploited, where the sum runs over energies Ei and longitudinal 
momentum components p z ^ of all particles in the final state. The requirement E — p z > 35 (42) GeV in 
the HI (ZEUS) analysis removes events where the escaping electron carries a significant momentum. It also 
suppresses events with hard initial state photon radiation. 

However, at low E' e the remaining background contribution after such a selection is of a size comparable 
to or even exceeding the genuine DIS signal. The further analysis steps differ for the HI and ZEUS analyses 
as discussed in the following. 

ZEUS Analysis Procedure The electron candidates are selected as compact electromagnetic energy depo- 
sitions in the Uranium Calorimeter (UCal). The position of the candidate is reconstructed using either the 
Small Angle Rear Tracking Detector (SRTD), which is a high-granularity lead-scintillator calorimeter, or with 
the Hadron-Electron Separator (HES), which is a silicon detector located in the electromagnetic section of the 
UCal. The candidates are selected such that E' e > 6 GeV 47 . 

The candidates are validated using information from the tracking devices. The acceptance region for 
ZEUS tracking is limited to polar angles 9 e < 154°. The tracking detectors do provide some coverage beyond 
9 e = 154°, up to 9 e « 168°, however the number of tracking layers is too sparse for full track reconstruction. 
The hit information from the tracking detectors can still be used. To do this, a "road" is created between the 
measured interaction vertex and the position of the electron candidate in the calorimeter. Hits in the tracking 
layers along the road are then counted and compared to the maximum possible number of hits. If too few hits 

47 Cut of E' e > 4 GeV is used for the event selection, although the binning for Fl measurement is chosen such that E' e > 6 GeV. 
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Fig. 54: Distribution of energy over momentum for tracks linked to clusters in the SpaCal with energy from 3.4 to 10 GeV that pass 
all the medium Q 2 analysis cuts. Tracks with a negative charge are assigned a negative E/p. 

are found, the candidate is assumed to be a neutral particle and it is rejected. To ensure the reliability of this 
method, the scattered electron is required to exit the central drift chamber at a radius R > 20 cm. Given that 
E' e > 6 GeV, this effectively limits the maximal y to y < 0.8 and the minimum Q 2 achievable at low y. In the 
HES analysis, events are measured down to y = 0.2 roughly translating to the Q 2 region, Q 2 > 24 GeV 2 . No 
background treatment based on the charge of the candidate is performed. 

The remaining jp background is estimated using Monte Carlo (MC) simulations. In order to minimise 
the model uncertainty of the jp simulation, a pure photoproduction sample is selected using an electron tagger 
placed close to the beam pipe about 6 meters away from the interaction point in the rear direction. It tags, 
with almost perfect efficiency and purity, the scattered electrons in such events which are not identified in the 
main detector and escape down the beam pipe. Photoproduction MC is verified against and normalised to this 
sample. The normalisation factor is found to be 1 ± 0.1 for all data sets. 

Figure [53] shows, as an example, comparisons of the 575 GeV data with simulated distributions, for 
the energy of the scattered electron, total E — p z , polar angle of the scattered electron, angle of the hadronic 
final state and the z coordinate of the interaction vertex. A good description of the data by the simulation is 
observed. A similar level of agreement was found for both, HER and LER data sets. 

A full set of systematic uncertainties is evaluated for the cross section measurements. The largest 
single contribution comes from the electron energy scale uncertainty, which is known to within ±1% for 
E' e > 10 GeV, increasing to ±3% at E' e =5 GeV. Other significant contributions are due to the ± 10% un- 
certainty in verifying the Pythia prediction of the jp cross section using the electron tagger. The systematic 
uncertainty due to the luminosity measurement was reduced by scaling the three cross sections relative to each 
other. The spread of relative normalisation factor was found to be within the expected level of uncorrected 
systematic uncertainty. 

HI Analysis Procedure The HI measurements of Fl are performed in separate analyses involving different 
detector components and thus covering different Q 2 ranges. In the high-Q 2 analysis the electron candidate is 
selected as an isolated electromagnetic energy deposition in the Liquid Argon (LAr) calorimeter which covers 
the polar angle range 4° < 9 < 153°. The selected cluster is further validated by a matching track recon- 
structed in the central tracking device (CT) with an angular acceptance of 15° < < 165°. In the medium 
Q 2 analysis the electron candidate is selected in the backward calorimeter SpaCal covering the angular range 
153° < 9 < 177.5° and is also validated by a CT track. Lower Q 2 values are expected to be accessed in the 
third analysis, in which the SpaCal cluster is validated by a track in the Backward Silicon Tracker reaching 
the highest 9. The first measurement of Fl at medium Q 2 is already published [181], and preliminary results 
of the combined medium-high- Q 2 analysis are available. 

The remaining 'yp background is subtracted on statistical basis. The method of background subtraction 





Fig. 55: Top: comparison of the correct sign data (points) with the sum (open histogram) of the DIS MC simulation and background, 
determined from the wrong sign data (shadowed histogram), for the energy E' e (left) and the polar angle 6 e (right) of the scattered 
electron, for the 460 GeV data with E' e < lOGeV. Bottom: as top but after background subtraction. 



relies on the determination of the electric charge of the electron candidate from the curvature of the associated 
track. 

Figure|54]shows the E/p distribution of the scattered electron candidates from e + p interactions with the 
energy E measured in the SpaCal and the momentum p of the linked track determined by the CT. The good 
momentum resolution leads to a clear distinction between the negative and positive charge distributions. The 
smaller peak corresponds to tracks with negative charge and thus represents almost pure background. These 
tracks are termed wrong sign tracks and events with such candidates are rejected. The higher peak, due to right 
sign tracks, contains the genuine DIS signal superimposed on the remaining positive background. The size 
of the latter to first approximation equals the wrong sign background. The principal method of background 
subtraction, and thus of measuring the DIS cross section up to y ~ 0.9, consists of the subtraction of the wrong 
sign from the right sign event distribution in each x, Q 2 interval. 

The background subtraction based on the charge measurement requires a correction for a small but 
non-negligible charge asymmetry in the negative and positive background samples, as has been observed pre- 
viously by HI [89]. The main cause for this asymmetry lies in the enhanced energy deposited by anti-protons 
compared to protons at low energies. The most precise measurement of the background charge asymmetry 
has been obtained from comparisons of samples of negative tracks in e + p scattering with samples of positive 
tracks in e~p scattering. An asymmetry ratio of negative to positive tracks of 1.06 is measured using the high 
statistics e^p data collected by HI in 2003-2006. This result is verified using photoproduction events with a 
scattered electron tagged in a subdetector of the luminosity system. 

Figure [55] shows, as an example, comparisons of the 460 GeV high y data with simulated distributions, 
for the energy and the polar angle of the scattered electron prior to and after subtraction of the background, 
which is determined using wrong sign data events. 

The measurement of Fl as described below relies on an accurate determination of the variation of the 
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cross section for a given x and Q 2 at different beam energies. In order to reduce the uncertainty related to the 
luminosity measurement, which presently is known to 5% for each proton beam energy of the 2007 data, the 
three data samples are normalised relatively to each other. The renormalisation factors are determined at low 
y, where the cross section is determined by F2 only, apart from a small correction due to Fl. The relative 
normalisation is known to within 1.6%. 

All correlated and uncorrected systematic errors combined with the statistical error lead to an uncer- 
tainty on the measured cross sections at high y of 3 to 5%, excluding the common luminosity error. 

4. 2. 4 Measurements of Fl(x,Q 2 ) by HI and ZEUS 

The longitudinal structure function is extracted from the measurements of the reduced cross section as the 
slope of a r versus y 2 /Y + , as can be seen in eq.[53] This procedure is illustrated in Fig. [56] The central 
Fl values are determined in straight-line fits to a r (x, Q 2 ,y) as a function of y 2 /Y + using the statistical and 
uncorrected systematic errors. 

The first published HI measurement of Fl{x, Q 2 ) is shown in Fig. [57] the preliminary ZEUS measure- 
ment is presented in Fig. [58] The HI measured values of Fl are compared with the HI PDF 2000 fit [18], 
while the ZEUS Fl values are compared to the ZEUS-JETS PDF fit [170]. Both measurements are consistent 
and show a non-zero Fl. 

The HI results were further averaged over x at fixed Q 2 , as shown in the left panel of Fig. [59] The 
averaging is performed taking the x dependent correlations between the systematic errors into account. The 
averaged values of Fl are compared with HI PDF 2000 fit and with the expectations from global parton 
distribution fits at higher order perturbation theory performed by the MSTW [182] and the CTEQ [131, 169] 
groups. Within the experimental uncertainties the data are consistent with these predictions. The measurement 
is also consistent with previous indirect determinations of Fl by HI. 

In the combined medium-high Q 2 analysis by HI the Q 2 range is extended up to Q 2 = 800 GeV 2 . The 
preliminary results are shown in the right panel of Fig.[59] In some Q 2 bins there is an overlap between the 
SpaCal and LAr measurements which improves the precision of the Fl extraction as compared to the pure 
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SpaCal analysis. 
4.2.5 Summary 

Direct measurements of the proton structure function Fl have been performed in deep inelastic ep scattering 
at low x at HERA. The Fl values are extracted by the HI and ZEUS collaborations from the cross sections 
measured at fixed x and Q 2 but different y values. This is achieved by using data sets collected with three 
different proton beam energies. The HI and ZEUS results are consistent with each other and exhibit a non- 
zero Fl. The measurements are also consistent with the previous indirect determinations of Fl by HI. The 
results confirm DGLAP NLO and NNLO QCD predictions for Fl(x,Q 2 ), derived from previous HERA data, 
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Fig. 59: The proton structure function Fl shown as a function of Q 2 at the given values of x: a) first direct measurement at HERA by 
HI; b) preliminary HI results combining SpaCal and LAr analyses. The inner error bars denote the statistical error, the full error bars 
include the systematic errors. The luminosity uncertainty is not included in the error bars. The solid curve describes the expectation 
on Fl from the HI PDF 2000 fit using NLO QCD. The dashed (dashed-dotted) curve depicts the expectation of the MSTW (CTEQ) 
group using NNLO (NLO) QCD. The theory curves connect predictions at the given (x, Q 2 ) values by linear extrapolation. 



which are dominated by a large gluon density at low x. 



5 PROTON-PROTON LUMINOSITY, STANDARD CANDLES AND PDFS AT THE LHC 48 
5.1 Introduction 

The Large Hadron Collider (LHC) is expected to start colliding proton beams in 2009, and is expected to reach 
design parameters in energy and luminosity sometime later and deliver a few fb^ 1 per year of data at the 14 
TeV collision energy. 

During the past 15 years many theoretical calculations and experimental simulations have demonstrated 
a huge potential to perform many accurate tests of the Standard Model (SM) with LHC data, which could yield 
insight into new physics mechanisms. 

To make these tests, the experiments identify a particular signature X and observe, using a variety of 
selection criteria, a certain number of events in a given data taking period. After correcting this event rate 
for backgrounds and the selection efficiency, the number is converted into a cross section. The cross section, 
&pp-^x can be compared with theoretical predictions 49 according to the formula: N corrected, = a pp^x x L pp 
where L pp is the recorded proton proton luminosity. 

Besides the statistical errors of a measurement, the systematic error is related to the uncertainties from 
the L pp determination, the background and efficiency corrections within the detector acceptance and from ex- 
trapolations into the uncovered very forward rapidity regions. The interpretation of an observed cross section 
within the SM requires further the knowledge of the theoretical cross section. Thus the uncertainties of the 
proton parton distribution function (PDF) have to be considered also. 

In this Section we describe the status and perspectives of the ATLAS, CMS and LHCb, the three LHC pp 
collision detectors [183], to determine the proton proton luminosity normalization. The investigated methods 
are known and studied since many years and can be separated into the absolute (1) direct and (2) indirect 
proton proton luminosity determination. A third approach (3) tries to measure and calculate final states only 
relative to well understood reactions which depend on the parton-parton luminosity and are as such largely 
independent of the knowledge of the pp luminosity. 

• Absolute, direct or indirect, proton proton luminosity normalization: If the absolute approach is used, 
the interpretations of a measured reaction cross section depends still on the knowledge of parton distri- 
bution function (PDF), which must be obtained from other experiments. Examples are: 

- The proton proton luminosity normalization is based on the measurements of the beam currents 

and shapes. While the beam currents can be accurately determined using beam transformers, 
the beam profiles are more difficult to determine directly and usually constitute the dominant 
source of uncertainty on a luminosity measurement using this technique. The use of the machine 
luminosity determination using beam parameter measurements [184] and [185] will be described 
in Section l5.3.1l Alternatively one can try to measure the beam profiles also within the experiments 
using the precision vertex detectors. A short description of this idea, currently pursued within the 
LHCb collaboration, is also given in Section l5.3.1l 

- The simultaneous measurements of a pair of cross sections that are connected with each other 
quadratically via the optical theorem. A well known example of this is the measurement of the 
total inelastic cross section and the elastic cross section at very high pseudorapidities \rj\ « 9 and 
will be described in Section [533] 

So called instantaneous or real time luminosity measurements are based on "stable" high rate 
measurements of particular final state reactions. Once the ratio of such reactions to the pp lumi- 
nosity determination has been measured, those reactions can be subsequently used as independent 
luminosity monitors. Some possibilities are discussed in Section [5331 

- The indirect absolute proton proton luminosity normalization is based on the theoretically well 
understood "two photon" reaction pp — » ppfifi [186, 187] (Section [5.3.51) . This reaction could 
perhaps be considered as the equivalent of the luminosity counting in e + e~ experiments using 
forward Bhabha scattering. 

48 Contributing authors: J. Anderson, M. Boonekamp, H. Burkhardt, M. Dittmar, V. Halyo, T. Petersen 

49 Alternatively, one can also apply a Monte Carlo simulation to the theoretical prediction and compare the number of background 
corrected events directly. 



• Indirect pp luminosity measurements use final states, so called "standard candles", with well known 
theoretical cross sections (Section \5A\) . 

Obviously, the resulting proton proton luminosity can only be as good as the theoretical and experimen- 
tal knowledge of the "standard candle" reaction. The theoretically and experimentally best understood 
LHC reactions are the inclusive production of W and Z bosons with subsequent leptonic decays. Their 
large cross section combined with experimentally well defined final states, e.g. almost background free 
Z and W event samples can be selected over a relative large rapidity range, makes them the preferred 
LHC "standard candle" reaction. Other interesting candidates are the high p t jet - boson (= 7, W or Z) 
final states. The indirect luminosity method requires also some knowledge of the PDFs, and of course, 
if one follows this approach, the cross section of the "standard candle" reaction becomes an input and 
can not be measured anymore. Thus, only well understood reactions should be considered as candidate 
reactions. 

• pp luminosity independent relative rate measurements using "standard candle" reactions. 

In addition to the above indirect pp luminosity determinations, "standard candle" reactions allow to 
perform luminosity independent relative event rate calculations and measurements. This approach has 
already been used successfully in the past and more details were discussed during the past HERA-LHC 
workshop meetings [1], For some reactions, this approach appears to be much easier and more accurate 
than standard cross section measurements and their interpretations. Perhaps the best known example at 
hadron colliders is the measurement and its interpretation of the production ratio for Z and W events, 
where Tevatron experiments have reached accuracies of about 1-2% [188, 189]. Another example is 
related to relative branching ratio and lifetime measurements as used for b-flavored hadrons. 

Furthermore the rapidity distributions of leptonic W and Z decays at the LHC are very sensitive to the 
PDF parameterization and, as was pointed out 10 years ago [190], one can use these reactions to determine the 
partem luminosity directly and very accurately over a large x (= parton momentum/proton momentum) range. 
In fact, W and Z production with low transverse momentum were found in this analysis to be very sensitive 
to qq luminosities, and the jet-boson final states, e.g. the jet-7, Z, W final states at high transverse momentum 
are sensitive to the gluon luminosity. 

In the following we attempt to describe the preparations and the status of the different luminosity mea- 
surements and their expected accuracies within ATLAS, CMS and LHCb. Obviously, all these direct and 
indirect methods should and will be pursued. In Section [531 we compare the advantages and disadvantages of 
the different methods. Even though some methods look more interesting and rewarding than others, it should 
be clear from the beginning that as many independent pp luminosity determinations as possible need to be 
performed by the experiments. 

We also try to quantify the systematic accuracies which might be achieved over the next few years. As 
these errors depend somewhat on the overall achieved luminosity, we need in addition a hypothetical working 
scenario for the first 4 LHC years. We thus assume that during the first year, hopefully 2009, data at different 
center of mass energies can be collected by ATLAS and CMS. During the following three physics years we 
expect that 10 TeV will be the highest collision energy in year I and that at most 100 pb _1 can be collected. 
We assume further that during the following two years the design energy of 14 TeV can be achieved and that 
a luminosity of about 1 fb _1 and 10 fb _1 can be collected respectively per year. During the first few years 
similar numbers are expected for the LHCb experiment. However once the LHC reaches the first and second 
phase design luminosity of 10 33 /cm 2 /sec and 10 34 /cm 2 /sec it is expected that the LHCb experiment will run 
at an average luminosity of 2 x 10 32 /cm 2 /sec (resulting in about 2 fb" 1 /per year). 

5.2 Luminosity relevant design of ATLAS/CMS and LHCb 

In the following we give a short description of the expected performance with respect to lepton and jet identi- 
fication capabilities. Especially the electron and muon measurement capabilities are important for the identi- 
fication of events with leptonic decays of W and Z bosons. 

Both ATLAS and CMS are large so called omni purpose experiments with a large acceptance and 
precision measurement capabilities for high p t electrons, muons and photons. Currently, the simulations 



of both experiments show very similar performance for a large variety of LHC physics reactions with and 
without jets. For the purpose of this Section we focus on the possibility to identify the production of inclusive 
W and Z decays with subsequent decays to electrons and muons. Both experiments expect excellent trigger 
accuracies for isolated leptons and it is expected that electrons and muons with momenta above 20-25 GeV 
can be triggered with high efficiency and up to \rj\ of about 2.5. The special design of the ATLAS forward 
muon spectrometer should allow to detect muons with good accuracy even up to \rj\ of 2.7. 

The operation of ALFA, a very far forward detector placed about 240 m down the beam line, is en- 
visaged by the ATLAS collaboration to provide an absolute luminosity measurement, either using special 
optics LHC running and the use of the optical theorem or using the total cross section measurement from the 
dedicated TOTEM experiment installed near CMS; results from this device can be expected from 2010 and 
on-wards. In addition to absolute luminosity measurements from ALFA the two detectors LUCID and the 
Zero-Degree-Calorimeter (CDC) [191] are sensitive to the relative luminosity at time scales of single bunch 
crossings. 

A similar approach for absolute and relative luminosity measurements is foreseen by the CMS experi- 
ment. Here it is planned that dedicated forward detectors, the Hadron Forward Calorimeter (HF) and the ZDC 
device provide similar results as the ones in ATLAS. 

Another technique that is expected to be available early on is a luminosity-independent measurement of 
the pp total cross section. This will be done using a forward detector built by the TOTEM experiment [192]. 

The LHCb experiment [193] has been designed to search for New Physics at the LHC through precision 
measurements of CP violating observables and the study of rare decays in the b-quark sector. Since the bb pairs 
resulting from the proton-proton collisions at the LHC will both be produced at small polar angles and in the 
same forward or backward cone, LHCb has been designed as a single-arm forward spectrometer covering the 
pseudo rapidity range 1.9 < rj < 4.9. The LHCb tracking system, which is composed of a silicon vertex 
detector, a warm dipole magnet and four planar tracking stations, will provide a momentum resolution of 
8P/P = (0.3 + 0.001AP/ GeV)Vo [194]. Muon identification is primarily achieved using a set of five planar 
multi-wire proportional chambers, one placed in front of the calorimeter system and four behind, and it is 
expected that for the momenta range 3-150GeV/c an identification efficiency of ~*98% and an associated pion 
dis-identification rate of ~1% will be achieved. The reconstruction of primary and secondary vertices, a task 
of crucial importance at b physics experiments, will be virtually impossible in the high particle multiplicity 
environment present with the nominal LHC running luminosity of 10 34 cm _2 s _1 - LHCb has therefore been 
designed to run at the lower luminosity of 2 x 10 32 cm" 2 s _1 . 

Recent LHCb simulations have shown that leptonic W and Z decays to muons can be identified with 
a small background in the forward and very forward rapidity region starting from rj of 1.9 and up to values 
larger than 4. As will be discussed later in more detail, the common muon acceptance region for the three LHC 
experiments between 1.9 and about 2.5 will allow to cross check and normalize the W and Z measurements 
in this region. Consequently the unique large rapidity from 2.5 to 4.9 can be used by LHCb to investigate the 
very low x range of the PDFs for the first time. 

The absolute luminosity at LHCb will be obtained either directly, by making measurements of the beam 
parameters, or indirectly via a measurement of the event rate of an accurately predicted physics process. 

As will be explained in the following Sections, all experiments will try to perform as many as possible 
direct and indirect absolute and relative luminosity measurements and will, if available, at least during the first 
years, also use luminosity numbers from the machine group. 

5.2.1 Lepton triggering and W/Z identification. 

Generally, the lepton trigger selections depend on the instantaneous luminosity and some pre-scaling might 
eventually needed. However, current simulations by all experiments show that the envisaged \r]\ a.ndp t thresh- 
olds will not limit the measurement accuracies of leptons originating from W and Z decays. 

The lepton trigger selections that generally perceived to be used for most W and Z related analysis are 
very similar in ATLAS and CMS as indicated in Table [9] 

Trigger and reconstruction efficiencies for leptonic W and Z decays within the acceptance of the detec- 
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Table 9: For ATLAS and CMS the lepton trigger/selection pt thresholds are given for single isolated leptons. *For the LHCb threshold 
is given for the muon pair mass instead of single muons and only positive values of r\ are covered. 



tors have been estimated for ATLAS to be 97.7% and 80.0% for electrons and 84.3% and 95.1% for muons, 
respectively. The reconstruction efficiency includes the trigger efficiencies and the off-line electron and muon 
selections used later to identify clean inclusive W and Z event samples [195]. 

The current equivalent trigger and off-line efficiencies for CMS are about 85% and 77% for electrons 
and combined about 85% for single muons [196]. Similar efficiency numbers for muons from W and Z decays 
are expected within the LHCb acceptance region [197]. Current simulations show that these numbers can be 
determined with high accuracies, reaching perhaps 1% or better, at least for isolated leptons 50 which have a 
transverse momentum some GeV above the trigger thresholds. For lower momenta near the thresholds or for 
additional special trigger conditions somewhat larger systematic uncertainties can be expected. 



5.3 Direct and indirect absolute pp luminosity measurements 

Three different absolute proton proton luminosity measurements are discussed in this Section. (1) The machine 
luminosity determination using beam parameter measurements [198], (2) the luminosity independent total pp 
cross section measurement combined with the measurement of the elastic pp scattering rate [192] and (3) 
the measurement of the "two photon" reaction pp — > ppfifi [186, 187]. As will be discussed in more detail 
in Section 1531 only method (3) can be performed during the normal collision data taking. For method (1) 
some special methods, which take the actual detector performance during each run into account, need to be 
developed. Method 2 uses a two phase approach (a) a special machine optics run with low luminosity to 
determine the total cross section and (b) a normalization to some high rate final state reactions which can be 
counted during normal physics runs. 



5.3.1 Proton-proton luminosity from machine parameters 

The luminosity for colliding beams can be directly obtained from geometry and numbers of particles flowing 
per time unit [184]. This can be used to determine the absolute LHC luminosity from machine parameters 
without prior knowledge of pp scattering cross sections. The principle is briefly outlined here. More details 
can be found in [185]. 
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Fig. 60: Luminosity from particles flux and geometry. 

For two bunches of N\ and N2 particles colliding head-on in an interaction region as sketched in Fig 
with the frequency / the luminosity is given as 

NiNo f 



50 As isolated high pt photons are triggered essentially like electrons similar accuracies for both particle types can be assumed. 

51 Contributing author: H. Burkhardt 



A e s is the effective transverse area in which the collisions take place. For a uniform transverse particle 
distribution, A c g would be directly equal to the transverse beam cross section. More generally, the effective 
area can be calculated from the overlap integral of the two transverse beam distributions g\{x,y), g2(x,y) 
according to 

9i(x,y)g 2 {x,y) dxdy . (59) 
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The collision frequency / is accurately known. The number of particles circulating in a storage ring is mea- 
sured using beam current transformers to roughly 1% precision [198]. 
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Fig. 61: Schematic view of the steps involved in an orthogonal separation scan proposed for the LHC (left) and a possible result in 
one direction (based on early LEP data) shown on the right. 



The main uncertainty in the absolute luminosity determination from machine parameters is expected to 
originate in the knowledge of the transverse beam dimensions. Safe operation of the LHC requires a rather 
good knowledge of the optics and beam sizes and we expect that this should already allow a determination of 
the luminosity from machine parameters to about 20 — 30 percent. A much better accuracy can be obtained 
when the size of the overlap region at the interaction points is determined by measuring the relative luminosity 
as a function of lateral beam separation, as illustrated in Fig.[6T| This technique was pioneered at the ISR [199] 
and allowed to reduce the uncertainty to below 1%, [200,201]. 

For the more complicated LHC and early operation, a 10% overall uncertainty in the absolute LHC 
machine luminosity calibration should be a realistic goal. The actual precision will depend on the running 
time and effort which is invested. A relatively small number of scans under favorable beam conditions will 
in principle be sufficient to obtain and verify the reproducibility in the absolute luminosity calibration. While 
fast scans may always be useful to optimize collisions, we assume that any dedicated, detailed luminosity 
scans will become obsolete when the other, cross section based luminosity determinations described in these 
proceedings allow for smaller uncertainties. 



Optimal running conditions are moderate bunch intensities, large bunch spacings, no crossing angle and 
p* = 2 m or larger. These conditions are in fact what is proposed anyway for the initial LHC operation with 
43 - 156 bunches per beam. Statistics are not expected to be a problem. For early operation at top energy (10 
- 14 TeV) with 43 bunches and 4 x 10 10 particles per bunch, before beams are squeezed, at a j3* = 11 m, we 
already expect luminosities of the order of 10 30 cm _2 s _1 resulting in event rates of 10 4 Hz, for a cross section 
of 0.01 barn as typical for the low angle luminosity monitors. 

From the LHC injectors, we expect bunch by bunch variations of about 10% in intensity and 20% in 
emittance. For the large spacing between bunches in the operation with up to 156 bunches, there is no need 
for crossing angles at the interaction points. Parasitic beam-beam effects will be negligible. All bunches in 
each beam will follow the same equilibrium orbit and collide at the same central position. 

Calibration runs require good running conditions and in particular good beam lifetimes. Bunch by bunch 
differences are not expected to change significantly during a scan. Storing bunch intensities at the beginning 
and end of a scan and using one set of timed averaged bunch intensities for a scan should be sufficient. To 
avoid any bias, it will be important to use the correct pairing of bunch intensities and relative luminosities 
in the calculation of absolute bunch luminosities according to Eq.[5Hl before any summing or averaging over 
different bunches. 

We are currently preparing an on-line application for automatic luminosity scans 52 . Scan parameters 
like range, step size and duration can be set before the start of the scan. Once the parameters are defined, it 
is possible to launch automatic horizontal and vertical separation scans in the LHC interactions regions. For 
a detailed scan, we may choose a range from -4 to +4 a in nominal beam size in steps of 0.5 a, resulting in 
17 equidistant points. If we wait 1 s between points to allow for the magnets to change and for 2 s integration 
time, the scan time would still be below a minute per plane. Details are currently being worked out in close 
collaboration with the experiments. Exchanging all data bunch-by-bunch at a 1 Hz rate between the machine 
control room (CCC) and the experiments would be rather demanding and risks to saturate current capacities. 

For the initial running, it will be sufficient to exchange average values at about 1 Hz rate. It allows 
quality monitoring and the determination of the peak position. For the detailed off line analysis, we only have 
to rely on local logging and timing information synchronized to at least 1 s precision at the beginning of the 
scan. With fixed time interval defined and saved before the scan, this allows for off-line synchronization of 
the detailed data and a complete bunch by bunch analysis. 



5.3.2 Direct measurements of the absolute luminosity at LHCb 

LHCb plans to measure the absolute luminosity using both the Van Der Meer scan, [199], and beam-gas 
techniques following a more recently proposed method [202]. Here one tries to determine the transverse 
beam profiles at colliding beam experiments utilizing the precision vertex detectors found at modern HEP 
experiments to reconstruct beam gas interactions near the beams crossing point. The vertex resolution in the 
transverse direction at LHCb can be parameterized by the relation 

lOO^m 

Ox,y = , (62) 
V M tracks 

where N tra cks is the number of tracks originating from the vertex. Since the nominal transverse bunch size at 
LHCb will be 100/xm, the reconstruction of beam-gas vertices's, which will have a track multiplicity of ~ 10, 
will enable the measurement of the colliding bunch profiles and the beam overlap integral. This method is 
currently under investigation by the LHCb collaboration and is expected to result in a luminosity measurement 
with an associated uncertainty of 3-5%. 

5.3.3 Absolute pp luminosity from specialized detectors and from the total cross section measurement 

ATLAS and CMS are planning to perform absolute and relative pp luminosity measurements using dedicated 
luminosity instruments. 



52 Done by Simon White, as part of his PhD thesis work on the LHC machine luminosity determination 



Three particular luminosity instruments will operate around the ATLAS interaction point. 
The absolute luminosity measurement will be provided by ALFA [191] placed 240m down the beam line and 
due to operate in 2010. This measurement requires some special optics low luminosity running of the LHC and 
should be able to measure the very low angle Coulomb scattering reaction. The expected precision is of the 
order 3%, depending on yet unknown LHC parameters during running. The ALFA detector can also measure 
the absolute luminosity using the optical theorem if the Coulomb region can not be reached. Extrapolating the 
elastic cross section to very low momentum transfer t = and using the total cross section as measured by 
TOTEM [192] (located at the CMS interaction point) current simulations indicate that a precision of about 3% 
might also be reached with this method. In addition to absolute luminosity measurements from ALFA, LUCID 
and a Zero-Degree-Calorimeter (ZDC) [191] are sensitive to the relative single bunch crossings luminosity. 
LUCID and ZDC will however not give absolute measurements. 

A similar approach is currently foreseen by the CMS collaboration [203]. 

5.3.4 Real time relative luminosity measurements 

A large number of instantaneous relative luminosity measurements have been discussed during the past years 
by ATLAS, CMS and LHCb and more details can be found in the three presentations given during the "stan- 
dard candle" session of this workshop [204]. As an example we outline in the following some ideas discussed 
within CMS. 

Multiple techniques capable of providing suitable luminosity information in real time have been identi- 
fied in CMS. One technique employs signals from the forward hadron calorimeter (HF) while another, called 
the Pixel Luminosity Telescope (PLT), uses a set of purpose-built particle tracking telescopes based on single- 
crystal diamond pixel detectors. At this writing, the PLT has not been formally approved, but is under study. 
The methods based on signals from the HF described are the ones being most vigorously pursued. 

Two methods for extracting a real-time relative instantaneous luminosity with the HF have been studied. 
The first method is based on "zero counting," in which the average fraction of empty towers is used to infer 
the mean number of interactions per bunch crossing. The second method called "EtSum method" exploits the 
linear relationship between the average transverse energy per tower and the luminosity. 

Outputs of the QIE chips used to digitize the signals from the HF PMTs on a bunch-by-bunch basis 
are routed to a set of 36 HCAL Trigger and Readout (HTR) boards, each of which services 24 HF physical 
channels. In order to derive a luminosity signal from the HTR, an additional mezzanine board called the HF 
luminosity transmitter (HLX) is mounted on each of the HTR boards. The HLX collects channel occupancy 
and Et sum data to create eight histograms: two sets of three occupancy histograms, one E^-sum histogram, 
and one additional occupancy histogram. These histograms comprise about 70 KB of data, which is transmit- 
ted at a rate of approximately 1.6 Mbps to a dedicated luminosity server via an Ethernet switch that aggregates 
the data from multiple HLX boards for further processing. 

Although all HF channels can be read by the HLX, MC studies indicate that the best linearity is 
obtained using only the inner four rj rings. The algorithm has been optimized to minimize sensitivity to 
pedestal drifts, gain changes and other related effects. Both "Zero Counting" and the "EtSum" method have 
demonstrated linearity up to LHC design luminosity. A statistical error of about 1% will be achieved at 
fewtimes x 10 cm s Hence the dominant error on the absolute luminosity will result from the normal- 
ization of the online relative luminosity. 

5.3.5 Proton-proton luminosity from the reaction pp — ► pp/x/i 

The QED process pp — ► pp[i + fi~ , where a /i + /U~ pair is produced via photon-photon scattering, was first pro- 
posed for luminosity measurements at hadron colliders in [186]. At the LHC such pairs will be predominantly 
produced with small transverse momenta, at small polar angles and in the same forward or backward cone. 

All three experiments are considering to use the well calculated pp — > ppfifx process for measuring ab- 
solute luminosity. The theoretical understanding of this QED photon-photon scattering reactions is considered 
to be accurate to better than 1%. Consequently this final state is thus often considered to be the perfect the- 
oretical luminosity process. However, the experimental identification of this process requires to select muon 



pairs with low mass and within a well understood acceptance. The measurement of this reaction at a hadron 
collider appears to be much more difficult than the corresponding measurements of the reaction ee — > ee/x/x 
at LEP. The systematic measurement error for example in L3 and after several years of data taking was about 
±3% [205] 

Current simulations by the three LHC experiments indicate that the final state can be identified using 
straight forward criteria. For ATLAS and CMS one finds that about 1000 accepted events could at best be 
expected for an integrated luminosity of 1 fb _1 , resulting in a statistical error of about ± 3%. 

For example the ATLAS study selects oppositely charged back-to-back muon tracks with p? > 6 GeV 
and 1 77 1 < 2.2 with an invariant mass less than 60 GeV and a common vertex with no other tracks originating 
from it (isolation), yields a cross section of 1.33 pb. Thus, about 1300 events can be expected for running 
periods with a luminosity of 1 fb _1 and yielding a potential statistical error of 3%. However, backgrounds 
not only from pile up events will be a critical issue. Some proton tagging with high luminosity roman pots is 
currently investigated but this will certainly reduce the accepted cross section and introduce additional accep- 
tance errors. Similar conclusions have been reached by simulations performed within the CMS collaboration. 
Consequently, both experiments expect that, during the coming years, this reaction will be mainly used as a 
cross check of the other methods. 

The cross section for this process where both muons lie inside the LHCb acceptance and have a com- 
bined invariant mass greater than 2.5GeV is « 88 pb. The expected uncertainty is perhaps 1% or smaller and 
comes mainly from rescattering corrections [187], i.e. strong interactions between the interacting protons. 

The feasibility of using the elastic two photon process pp — > p + + p to make luminosity mea- 

surements at LHCb was first explored in [206] and has recently been investigated in more detail by members 
of the LHCb collaboration [207]. A variety of background processes have been studied: dimuons produced via 
inelastic two-photon fusion and double pomeron exchange; low mass Drell-Yan pairs; QCD processes such 
as bb — ► n + fJ>~ + X; and the combinatoric backgrounds caused by KJtt mis-identification. A simple offline 
selection has been developed that requires: the dimuon pair transverse momentum to be less than 50MeV/c; 
the dimuon invariant mass to be in the range 2.5GeV/c 2 < < 20GeV/c 2 ; and a charged particle mul- 
tiplicity of less than 3 (i.e. the event should contain a pair and no other charged particles). These 
criteria select ~ 27% of the signal events that pass the trigger and are reconstructed and result in a background 
contamination that is (4.1 ± 0.5(stat.) ± l.0(syst.))% of the signal level with the dominant contribution 
due K/7r mis-identification. Overall it is expected that ~ 10 4 pp — > p + + p events will be triggered, 
reconstructed and selected at LHCb during one nominal year of data taking (2/6 -1 ). Systematic uncertainties 
on a luminosity measurement at LHCb using this channel are estimated to be ~ 1.31% and are dominated by 
the uncertainty on the predicted cross section for events containing dimuons produced via double pomeron 
exchange, an uncertainty that is expected to be reduced in the near future. A measurement of the absolute 
luminosity at LHCb using this channel and a dataset of 2fb~ l will therefore be possible with an associated 
uncertainty of ~ 1.5%. 

In summary, the accurate measurement of this theoretically well understood reaction looks like an 
interesting challenge for the LHC experiments. Interesting results can be expected once integrated luminosities 
of 5 fb _1 and more can be accumulated for ATLAS and CMS and about 1 fb _1 for LHCb. Of course, it 
remains to be proven, if the systematic uncertainties under real data taking conditions can indeed be reduced 
to the interesting 1 % level. 

5.4 Indirect and relative pp luminosity measurements 

The methods to measure the absolute proton proton luminosity and their limitations have been described in 
the previous chapter. 

In this Section we will describe the possibilities to measure the luminosity indirectly using well defined 
processes, so called "Standard Candles" and their use to further constrain the PDFs and discuss the possibility 
to "measure" directly the parton-parton luminosities. 

Before describing the details of these indirect approaches, a qualitative comparison of luminosity mea- 
surements at e + e~ colliders and hadron colliders might be useful. The most important difference appears 



to be that in the e + e~ case one studies point like parton parton interactions. In contrast, at hadron hadron 
interactions one studies the collision of protons and other hadrons made of quarks and gluons. As a result, 
in one case the Bhabha elastic scattering reaction e + e~ — > e + e~ at low Q 2 reaction can be calculated to 
high accuracy and the observed rate can be used as a luminosity normalization tool. In contrast, the elastic 
proton proton scattering cross section can not be calculated at the LHC nor at any other hadron colliders. As a 
consequence, absolute normalization procedures depend always on the measurement accuracy of the pp total 
cross section. Even though it is in principle possible to determine the pp total cross section in a luminosity 
independent way using special forward detectors like planned by the TOTEM or the ALFA experiments, the 
accuracy will be limited ultimately and after a few years of LHC operation to perhaps a few %. 

Furthermore, as essentially all interesting high Q 2 LHC reactions are parton parton collisions, the ma- 
jority of experimental results and their interpretation require the knowledge of parton distribution functions 
and thus the parton luminosities. 

Following this reasoning, more than 10 years ago, the inclusive production of W and Z bosons with 
subsequent leptonic decays has been proposed as the ultimate precision parton parton luminosity monitor at 
the LHC [190]. The following points summarize the arguments why W and Z production are indeed the ideal 
"Standard Candles" at the LHC. 

• The electroweak couplings of W and Z bosons to quarks and leptons are known from the LEP mea- 
surements to accuracies smaller than 1% and the large cross section of leptonic decays W and Z bosons 
allows that these final states can be identified over a large rapidity range with large essentially back- 
ground free samples. 

• Systematic, efficiency corrected counting accuracies within the detector acceptance of 1% or better 
might be envisioned during the early LHC running. In fact it is believed that the relative production rate 
of W and Z can be measured within the detector acceptance with accuracies well below 1%. 

• Theoretical calculations for the W and Z resonance production are the most advanced and accurately 
known LHC processes. Other potentially more interesting LHC reactions, like various diboson pair 
production final states are expected to have always larger, either statistical or systematic, experimental 
and theoretical uncertainties than the W and Z production. 

• The current PDF accuracies, using the latest results from HERA and other experiments demonstrate 
that the knowledge of the quark and anti quark accuracies are already allowing to predict the W and 
Z cross at 14 TeV center of mass energies to perhaps 5% or better. The measurable rapidity and p t 
distributions of the Z boson and the corresponding ones for the charged leptons from W decays can be 
used to improve the corresponding parton luminosity functions. 

Obviously, the use of W and Z bosons as a luminosity tool requires that the absolute cross section 
becomes an input, thus it can not be measured anymore. As a result this method has been criticized as being 
"a quick hack at best". In contrast, advocates of this method point out that this would not be a noticeable loss 
for the LHC physics program. 

5.4.1 Using the reaction pp — > Z — > £ + £~ to measure L pp 

Very similar and straight forward selection criteria for the identification of leptonic Z decays, depending 
somewhat on the detector details and the acceptance region, are applied by ATLAS, CMS and LHCb. In the 
following the current selection strategy in ATLAS and LHCb are described. 

5.4.2 Measuring Zand W production, experimental approaches in ATLAS 

The ATLAS W and Z cross section measurements are based on the following selections in the electron and 
muon channels: 

• A typical selection of W — ► ev requires that events with "good" electrons have to fulfill the additional 
kinematic acceptance criteria: 

p T > 25 GeV, |t?| < 1.37 or 1.52 < \-q\ < 2.4. 



The criteria for W — > [iv muons are similar where px > 25 GeV and \rj\ < 2.5. is required. Further- 
more, in order to classify the event as a W event, the reconstructed missing transverse momentum and 
the transverse mass should fulfill ET{miss) > 25GeV and my (IV) > 40 GeV. 
• The selection of Z — > ee and Z — > requires that a pair of oppositely charged electrons or muons 
is found. Due to lower background the electrons should have p? > 15 GeV and \rj\ < 2.4 and their 
invariant mass should be between 80-100 GeV. 

Similar criteria are applied for the muons with pr > 15 GeV and \r)\ < 2.5. The reconstructed mass 
should be between 71-111 GeV. 

Following this selection and some standard Monte Carlo simulations, the expected number of recon- 
structed events per 10 pb^ 1 at = 14 TeV are about 45000, 5500 for W and Z decays to electrons and 
60000, and 5000 for the decays to muons, respectively. Thus, even with a small data sample of only 10 pb" 1 , 
the statistical uncertainty for the Z counting comes close to 1 % in each channel. 

Systematic uncertainties from the experimental selection are dominated by the Z efficiency determina- 
tion and from backgrounds in the W selection. Other sources of uncertainties originate from the knowledge 
of energy scale and the resolution. The lepton efficiencies are evaluated by considering Z — > 11 events and 
using the so called "tag and probe" method, like for example described by the DO experiment [188, 189]. The 
efficiency uncertainty associated with the precision of this method has been estimated for a data sample of 50 
pb -1 (1 ftT 1 ) of data to be 2% (0.4%) for W and 3% (0.7%) for Z events. The backgrounds for W events are 
of the order 4% in the electron channel and 7% in the muon channel. The main contributions are from other 
W or Z decays, and are thus well understood, leading to background uncertainties of the order 4% for both 
channels if a sample 50 pb" 1 is analyzed. For much larger samples it is expected that uncertainties at or below 
1 % can be achieved. The backgrounds for the Z decays are very small, and can be determined accurately from 
mass spectrum, and hence does not carry any sizable uncertainty. It has been demonstrated, that the detector 
scales and resolutions can be determined very accurately [195], and the associated uncertainties are therefore 
also close to negligible. 

Some detailed studies demonstrate that eventually the systematic error between 1-2% or even smaller might 
be achieved for the W and Z counting and within the detector acceptance up to rapidities of about 2.5. 

In order to use this number for the pp luminosity determination the total inclusive W and Z cross-section 
at NNLO can be used. These have been calculated to be 20510 pb and 2015pb, respectively [208]. Variations in 
models, floating parameters, and other theoretical uncertainties lead to significant variations in the estimates. 
The uncertainties on these calculation are estimated to be 5% or smaller. This uncertainty appears to be 
currently dominated by the PDF uncertainties needed to extrapolate to the experimentally uncovered large 
rapidity region. More discussions about these uncertainties can be found for example at [209] and [210]. 

It can be assumed that the detailed studies of the rapidity distributions within the acceptance region with 
W and Z decays might eventually lead to further error reductions. 

5.4.3 Measuring Z production, experimental approach in LHCb 

The uncertainty on the predicted Z production cross section at the LHC comes from two sources: the un- 
certainty on the NNLO partonic cross section prediction [208], which contributes an uncertainty of < 1%, 
and uncertainties in our understanding of the proton Parton Distribution Functions (PDFs) which, for the lat- 
est MSTW fit [39], contribute an uncertainty of ~ 3% for Z bosons produced with rapidities in the range 

-5 < y < 5. 

A measurement of the Z production rate at LHCb via the channel Z — > p + p~, which provides a final 
state that is both clean and fully reconstructible, can be achieved with high efficiency and little background 
contamination. In addition, since the dimuon trigger stream at LHCb [211] requires two muons with an 
invariant mass larger than 2.5GeV and a summed transverse momentum (Pj, + Pj,) greater than 1.4GeV, a 
high trigger efficiency of ~ 95% is expected for these events. A variety of background sources for this channel 
have been investigated: other electroweak processes such as Z — > t + t~ where both taus decay to muons and 
neutrinos; QCD processes such as bb — ► p + p~ +X; and events where two hadrons with an invariant mass near 
the Z mass are both mis-identified as muons. To deal with these backgrounds an off-line selection has been 
developed [212] that requires: the dimuon invariant mass to be within 20 GeV of the Z mass; the higher and 



lower transverse momentum muons to be greater than 20 GeV and 15 GeV respectively; the impact parameter 
of both muons is consistent with the primary vertex; and both muons have associated hadronic energy that 
is less than 50 GeV. For Z — > fi + fi~ events that are triggered and reconstructed at LHCb, these off-line 
selection criteria will select 91 ± 1% of the signal events while reducing the background to (3.0 ± 2.9)% of 
the signal level with the dominant contribution due to the combinatoric backgrounds from pion and kaon mis- 
identification. It is expected that these backgrounds can be well understood from real data or removed using 
muon isolation criteria. Overall it is expected that Z — > {J. + fJ>~ events will be triggered, reconstructed and 
selected at LHCb at a rate of ~ WOevts/pb -1 . Systematic uncertainties have also been investigated and it is 
expected that with as little as 5p6 _1 of data the experimental efficiency (trigger, tracking, muon identification 
etc.) can be measured with an uncertainty of ~ 1.5% enabling a luminosity measurement with an uncertainty 
of ~ 3.5%. 

5.4.4 PDF and relative parton-parton luminosity measurements 

Theoretically well understood reactions at the LHC offer the possibility to use their rapidity distributions to 
improve todays knowledge of PDFs. Especially the resonance production of W and Z bosons with lep tonic 
decays with low and high transverse momentum and the production of isolated high p t 7- Jet events have 
been demonstrated to be very sensitive to the relative parton distribution functions. Simulations from ATLAS 
and CMS have shown that experimental errors on these rapidity regions up to \y\ of about 2.5 can proba- 
bly performed with accuracies eventually reaching perhaps 1% or better. The possibility to cross-check the 
measurements with W and Z decays to (a) electron(s) and (b) muon(s) and between both experiments will of 
course help to reach the accuracy. 

During the past years simulation studies from the LHCb collaboration have shown that the experiment 
has a unique potential to extend the acceptance region from ATLAS and CMS for muons up to rapidity values 
at least up to 4.5. Furthermore, the existing overlap region for y between 1.9 and 2.5 should allow to reduce 
normalisation uncertainties. Obviously, these rapidity values are understood as being reasonably accurate but 
qualitative values and more precise values will be defined once real data will allow to define a well understood 
fiducial volume of the detectors. 

In addition, the LHCb collaboration has investigated the possibility to identify clean samples of very low 
mass Drell-Yan mu-pair events. The results indicate that such pairs can be measured within their acceptance 
region down to masses of 5 GeV. Such a measurement would in principle allow to measure PDFs for x values 
approaching extremely low values of 10~ 6 for the first time [213]. 

It should be clear that such measurements, which are known to be very sensitive to quark, antiquark 
and gluon relative parton luminosities will not allow an absolute PDF normalisation. Such an improvement of 
absolute PDF normalisation would require the accurate knowledge of the proton-proton luminosity to better 
than todays perhaps ±3% PDF accuracy obtained from the HERA measurements over a large x range and 
obviously lower Q 2 . The alternative approach to combine the relative parton luminosities over the larger x, Q 2 
range using the sum rules has, to our knowledge, so far not been studied in sufficient detail. 

A more detailed analysis of the different experimental approaches to improve the PDFs are interesting 
but are beyond the scope of this note about the luminosity. Nevertheless we hope that the experimentalists 
of the three collaboration will start to combine their efforts and will pursue the PDF measurements, in direct 
collaboration with theorists, during the coming years. 

5.5 Comparing the different pp luminosity measurements 

A relatively large number of pp luminosity measurements has been proposed and the most relevant have been 
discussed in this note. Here we try to give a critical overview of the different methods and their potential 
problems. Despite these advantages and disadvantage it should be clear that it is important to perform as 
many as possible independent luminosity methods during the coming years. 

• The machine luminosity determination using beam parameters: 

This method will be pursued independently of the experiments and its main purpose will be to opti- 
mize the performance of the LHC and thus providing a maximum number of physics collisions for the 



experiments. The potential to use this number as an almost instantaneous absolute luminosity number 
with uncertainties of perhaps ± 10% (and eventually ± 5%), assuming that non gaussian tails of the 
beam can be controlled to this accuracy will certainly be useful to the experiments. Of course the ex- 
periments would lose somewhat their "independence" and still need to combine this number with their 
actual active running time. 

However, one should remember that the Tevatron experiments did not use this method for their mea- 
surements. 

The method to determine the beam size using the LHCb precision vertex detector look very promising 
and it is hoped that their approach might result in a pp luminosity measurement with an associated 
uncertainty of 3-5%. 

• Total cross section and absolute luminosity normalisation with specialized far forward Detectors: 

The luminosity independent total pp cross section measurement is planned by the TOTEM collaboration 
and by the ALFA detector. Using these numbers both ATLAS and CMS plan to obtain the pp luminosity 
from the counting of the pp elastic scattering counting numbers from the forward detectors which thus 
depend on the knowledge of the total cross section measurement. In order to obtain this number some 
few weeks of special optics and low luminosity LHC running are required. As all LHC experiments 
are very keen to obtain as quickly as possible some reasonable luminosity at 14 TeV center of mass 
energy it is not likely that those special LHC data taking will happen during the first year(s) of data 
taking. Furthermore, despite the hope that the total cross section can be determined in principle with an 
interesting accuracy of ± 1%, it remains to be demonstrated with real LHC running. In this respect it 
is worth remembering that the two independent measurements of the total cross section at the Tevatron 
differed by 12% while much smaller errors were obtained by the individual experiments. As a result the 
average value with an error of ±6% was used for the luminosity normalisation. 

• Luminosity determination using Z — > it. 

This method provides an accurate large statistic relative luminosity number. It will be as accurate as the 
theoretical cross section calculation, which is based on the absolute knowledge of the PDFs from other 
experiments, from unknown higher order corrections and their incomplete Monte Carlo implementation. 
Todays uncertainties are estimated to be about 5%. It has been estimated, assuming the experiments 
perform as expected, that the potential Z counting accuracy within the acceptance region including 
efficiency corrections might quickly reach ±1%. The extrapolation to the uncovered rapidity space, 
mainly due to the worse knowledge of the PDFs in this region, increases the error to perhaps 3%. Taking 
other theoretical uncertainties into account an error of ±5% is currently estimated. Of course, advocates 
of the Z normalisation method like to point out that the real power of this method starts once relative 
measurements, covering similar partons and similar ranges of the parton distribution functions will be 
performed with statistical errors below 5%. Examples where such a normalization procedure looks 
especially interesting are the relative cross section measurements of N(Z)/N(W), N(W + )/N(W~), 
high mass Drell-Yan events with respect to Z events and diboson final states decaying to leptons. Of 
course, correlations and anticorrelations between quark and gluon dominated production rates exist 
and need to be carefully investigated before similar advantages for the gluon PDFs can eventually be 
exploited. The loss of an independent Z cross section measurement would of course be a fact of life. 

• pp luminosity from the reaction pp — > pp/^/u: 

A measurement of this reaction offers in principle a direct and theoretically accurate proton proton 
luminosity value. Unfortunately current simulations from the experiments indicate that the accepted 
cross section is relatively small and only a few 1000 events can be expected per fb _1 . The different 
simulation results indicate that the backgrounds can be suppressed sufficiently without increasing the 
experimental systematics too much. The current simulation results indicate that small systematic errors 
of perhaps 1-2% might eventually be achievable 53 once a yearly luminosity of 5-10 fb" 1 in ATLAS and 
CMS (2 fb _1 for LHCb) might be recorded. It remains to be seen if muons with transverse momenta 
well below 20 GeV can indeed be measured as accurately as muons with transverse momenta above 25 
GeV. 



"It might be interesting to study the experience from similar measurements at the experimentally ideal conditions of LEP, where 
uncertainties above ± 3% have been reported [205]. 



5.5.1 Which luminosity accuracy might be achievable and when 

Of course the potential time dependent accuracy of the different luminosity methods can only be guessed 
today as such numbers depend obviously on the LHC machine performance during the coming years. For the 
purpose of this Section we are mainly interested in measurements at the 14 TeV center of mass energy and 
assume that the following "data samples" would define such "years". Of course, it could be hoped that the 
luminosity and energy increase would go much faster resulting in "some" shorter LHC years. Thus we assume 
that the first 14 TeV year, currently expected to be 2010, will correspond to 0.1 fb" 1 , followed by a 1 fb _1 
year. During the third and fourth year ATLAS and CMS expect to collect about 5 fb" 1 and 10 fb _1 while 
LHCb expects to collect roughly 2 fb _1 per year. We assume further that the special optics low luminosity 
data taking periods requiring perhaps a few weeks for TOTEM and similar for ALFA will take only place 
during the year when more than 1 fb _1 per year or more can be expected. 

As a result, for the first two 14 TeV running years, realistic luminosity numbers could come from (1) 
the machine group and (2) from the indirect method using the inclusive production of Z events with leptonic 
decays. 

As has been pointed out in Section [5.3. II the method (1) would, without any additional efforts by the 
machine group, allow a first estimate with a ± 20-30% luminosity accuracy. We assume however that, due to 
the delay of the real 14 TeV start to 2010, enough resources could be found that people within the machine 
group could carefully prepare for the necessary beam parameter measurements and that the experiments will 
do the corresponding efforts to correct such a machine luminosity number for real detector data taking one 
could hope for a 10% measurement for 2010 and a 5% accuracy for 2011. 

In contrast, method (2) would by definition be an integrated part of any imaginable experimental LHC 
data taking period. In fact, if enough attention is put into the Z counting method, the data expected during 
2010 running might already reach statistical errors of ± 2% per 5 pb _1 periods. Thus perhaps about 10-20 
such periods could be defined during the entire year and systematic errors for the lepton efficiency correction 
within the detector acceptance could reach similar ± 2-3% accuracies. During the following years these 
errors might decrease further to 1% or better. Once the rate of any "stable" simple high rate final states and 
even trigger rates relative to the Z counting rate has been determined, such relative event rates can be used 
subsequently to track the "run" luminosity and even the real time luminosity with similar accuracy. 

Theoretical limitations of the cross section knowledge, not expected to improve without LHC data tak- 
ing, would limit the accuracy to about ± 5%. The expected detailed analysis of the 2010 rapidity distributions 
of W, Z and 7-jet events will allow some improvements for the years 201 1 and beyond. We can thus expect 
that appropriate ratio measurements like the cross section ratio measurements of Z/W and W~ /W + will 
already reach systematic accuracies of ± 1-2% during 2010 and 1% or better in the following years. Mea- 
surement of b physics, either in LHCb or in ATLAS and CMS might in any case prefer to perform luminosity 
independent measurements and relate any of the "new" measurements to some relatively well known and 
measurable B-hadron decays. 

It is also worth pointing out that currently no other high Q 2 reaction has been envisioned, which might 
be measurable to a systematic precision of better than 5-10% and a luminosity of up to lfb -1 . In addition, 
most of the interesting high Q 2 electroweak final states will unfortunately even be limited for the first few 
LHC years to statistical accuracies to 5% or more. 

The prospect for the other luminosity measurements start to become at earliest interesting only once a 
few 100 pb" 1 can be recorded. Consequently one can expect to obtain a statistical interesting accuracy from 
the reaction pp — > pp\i[i after 2010. Similar, it looks unlikely that low luminosity special optics run will 
be performed before 2011. Consequently one might hope that few % accurate total cross section numbers 
become available before the 2012 data taking period will start. 

5.6 Summary and Outlook 

A large variety of potentially interesting pp luminosity measurements, proposed during the past 10-15 years, 
are presented in this Section. 

Realistically only the machine luminosity measurement and the counting of the Z production might 



reach interesting accuracies of 5% before 2011. For all practical puiposes it looks that both methods should 
be prepared in great detail before the data taking at 14 TeV collision energies will start in 2010. 

We believe that a working group, consisting of interested members of the three pp collider experiments 
and interested theorists, should be formed to prepare the necessary Monte Carlo tools to make the best possible 
use of the soon expected W and Z data, not only for the pp luminosity normalization but even more for the 
detailed investigations of the parton parton luminosity determination and their use to predict other event rates 
for diboson production processes and high mass Drell-Yan events. 
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This document demonstrates the vast amount of progress that has taken place in the last years on pinning down 
the PDFs of the proton, as well as the dramatic increase in awareness of the impact of PDFs on the physics 
program of LHC experiments. The HERALHC workshop has acted as a regular forum for working meetings 
between the experiments, PDF phenomenologists and theorists. In the course of this workshop, it was realized 
that the momentum on the PDF studies should be kept and perhaps even focused more on the LHC, in order 
to continue the discussions, investigations and further work towards improving our knowledge on the PDFs. 

Clearly, LHC will need the best PDFs, especially for precision measurements, setting of limits in 
searches, and even for discoveries. Ideally the ATLAS and CMS (and LHCb and ALICE) analyses should 
follow a common procedure for using PDFs and their uncertainties in their key analyses. Such a common 
procedure, across the experiments, is being used in other contexts, such as significance estimates in searches. 
Also, changing frequently the PDFs in the software of the experiments, e.g. for cross-checks or the determina- 
tion of error bands, is often non-trivial (e.g. due to the inter-connection with parameter choices for underlying 
event modeling, showering parameters and so on) and sometimes impractical if CPU intensive detector simu- 
lations are involved. LHC studies therefore will need both good central values for the PDFs to start with, and 
a good estimate of the associated uncertainties. 

This has triggered the so called PDF4LHC initiative. PDF4LHC offers a discussion forum for PDF 
studies and information exchange between all stake-holders in the field: the PDF global fitter groups, such 
as CTEQ and MSTW; the current experiments, such as the HERA and Tevatron ones; QCD theorists and 
the LHC experimental community. The PDF4LHC initiative started in 2008. More details and links to the 
meetings so far can be found on the PDF4LHC web site [214]. 

The mission statement of PDF4LHC is: 

• Getting the best PDFs, including the PDF uncertainties, based on the present data. 

• Devise strategies to use future LHC data to improve the PDFs. 

All this needs a close collaboration between theorists and those that are preparing to make the measurements. 
In order to reach the first goal, the PDF4LHC forum aims to stimulate discussions and trigger further compar- 
ison exercises across the PDF community, in order to select one or a limited number of possible strategies that 
can be adapted to determine and use PDFs. For the second goal, PDF4LHC should also be a forum for discus- 
sions on how to include measurements from the LHC to constrain PDFs: what should be measured at LHC, 
and correspondingly calculated in theory. Such measurements include W and Z production and asymmetries, 
di-jet production, hard prompt photons, Drell-Yan production, bottom and top quark production, Z-shape fits 
and Z+jets measurements. One expects that some of these channels can already be studied with first data, 
hence we need to prepare for that well in advance. 

The following issues are part of the program for in depth discussions via topical workshops, some of 
which took place already in 2008 [214]. 

• Data to be included in the PDFs. Would we get better results with a selection of data to be used? New 
data will become available such as Fl(x, Q 2 ), and combined data from Hl/ZEUS. Can we extract more 
from the data? 

• Determination of PDF uncertainties, including the statistical treatment of the data. 

• Theoretical uncertainties and regions/processes where they matter: higher-order corrections; heavy 
flavour treatment; low-x (and high-x) resummation; other PDFs like unintegrated PDFs (and GPDs). 

• PDFs for usage Monte Carlo generators. 

One can expect that the LHC experiments most likely will be using for most of their studies the PDF 
sets and errors that are delivered by either one of the CTEQ or MSTW family. Hence it is important that 
the lessons learned from exercises on studies of the systematics on PDFs will be adapted by these main 
global PDF providers. PDF4LHC aims to advice the experiments in the use for PDFs for the LHC, based on 
the discussions, results and future consensus at the forum. The experience and results from HERAPDFs, and 
PDFs from other groups, like the Neural Net or Alekhin ones are extremely valuable in this discussion and will 
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serve as crucial input in studies to demonstrate how well we actually know the partem distributions. Several 
important benchmark exercises have been already performed and are reported in section 3 of this report. 

A special case are the PDFs for Monte Carlo generators. For experiments it is important that gener- 
ated events be kinematically distributed close to the distribution of the real data, such that the simulated and 
reconstructed Monte Carlo events can be used in a straightforward way to calculate efficiencies for e.g. experi- 
mental cuts in an analysis. In case the initially generated distribution does not resemble the data close enough, 
the Monte Carlo samples need to be reweighted, with all its possible drawbacks. Since calculations based on 
LO Matrix Elements and LO PDFs are known not to describe the data well, and NLO Matrix Element based 
generators to date have so far only a restricted number of processes implemented, studies are ongoing on so 
called "improved LO" PDFs, which try to cure some of the LO PDF drawbacks. Examples are given in [215]. 
This is yet another part of the discussions in the PDF4LHC forum 

In short, it is crucial that the work started here continues, with discussions and studies on PDFs and 
their uncertainties, the impact of the upcoming data on future PDF determinations and more, all with special 
focus on the needs for the LHC. The PDF4LHC initiative will offer a framework to do all this. 
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